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MASS LABELS 

This mventioii relates to useful confounds for labelling molecules of interest, particularly 
biomolecules such as peptides and proteins. Specifically this invention relates to labelliug 
of analytes for detection by mass spectrometry and associated methods of analysing mass 
labelled analytes by mass spectrometry. 

Various methods of labelling molecules of interest are known in the art, including 
radioactive atoms, fluorescent dyes, luminescent reagents, electron capture reagents and 
ligbt absorbing dyes. Each of theise labelling systems has features which make it suitable 
for certain applications and not others. For reasons of safety, interest in non-radioactive 
labelling systems lead to the wide^ead conmerdal development of fluorescent labelling 
schemes particularly for genetic analysis. Fluorescent labelling schemes permit the 
labelling of a relatively small number of molecules sinndtaneously, typically 4 labels can 
be used simultaneously and possibly iip to eight Ho\^ever the costs of the detection 
apparatus and the difficulties of analysing the resultant signals limit the nunober of labels 
that can be used simultaneously in a fluorescence detection scheme. 

More recently there has been development in the area of mass spectrometry as a method 
of detecting labels that are cleavably attached to their associated molecule of interest In 
many molecular biology applications one needs to be able to perform separations of the 
molecules of interest prior to analysis. These are generally liquid phase separations. Mass 
spectrometry in recent years has developed a number of interfaces for liquid phase 
separations which make mass spectrometry particularly efifective as a detection system for" 
these kinds of applications. Until recenfly Liquid Chromatography Mass Spectrometry 
was used to detect analyte ions or their fragment ions directly, however for many 
applications such as nucleic acid analysis, the structure of the analyte can be determined 
from indirect labelling. This is advantageous particularly with respect to Ibe use of miass 
spectrometry because conq>lex biomolecules such as DNA have conqpl^ mass spectra 
and aire detected with relatively poor sensitivity. Indirect detection means that an 
associated label molecule can be used to identify the original analyte, where the label is 
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designed for sensitive detectian and a siinple mass -spectrum. Siiqple mass spectra mean 
that multiple labels can be used to analyse multiple analytes sinoultaneously. 

PCT/GB98/00127 describes arrays of nucleic acid probes covalently attached to cleavable 
labeb that are detectable by mass spectrometry which identify the sequence of the 
covalently linked nucleic acid probe. The labelled probes of this application have the 
structure Nu-L-M where Nu is a nucleic add covalenfly linked to L, a cleavable linker, 
covalently linked to M, a mass label. Prefenred cleavable linkers in this application cleave 
within the ion source of the mass spectrometer. Preferred mass labels are substituted poly- 
aryl ethers. These application discloses a variety of ionisation methods and analysis by 
quadrupole mass analysers, TOP analysers and magnetic sector instruments as specific 
methods of analysing mass labels by mass spectrometry. 

PCT/Gi394/0167S disclose ligands, and specifically nucleic acids, cleavably linked to 
mass tag molecules. Preferred cleavable linkers are photo-cleavable. These application 
discloses Matrix Assisted Laser Desoiption Ionisation (MALDI) Time of Flight (TOP) 
roBss spectrcmietry as a specific method of analysing mass labels by mass spectrometry. 

PCT/US97/22639 discloses releasable non-volatile mass-label molecules. In preferred 
embodiments these labels comprise polymers, typically biopolymers which are cleavably 
attached to a reactive group or ligand, i.e. a probe. Preferred cleavable linkers appear to 
be chemically or enzymatically cleavable. This appUcation discloses MALDI TOP mass 
spectrometry as a specific method of analysing mass labels by mass spectrometry. 

PCT/US97/01070, PCT/US97/01046, and PCT/US97/01304 disclose hgands, and 
specifically nucleic adds, cleavably liiiked to mass tag molecules. Preferred cleavable 
linkers ^ear to be chemically or photo-cleavable. These application discloses a variety 
of ionisation methods and analysis by quadnqpole noass analysers, TOP analysers and 
magnetic sector instruments as specific methods of analysing mass labels by mass 
spectrometry. 
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None of these prior art applications mention the use of tandem or serial mass anal3^s for 
use in analysing mass labels. 

Gygi et al (Nature Biotechnology 17: 994-999, "Quantitative analysis of complex protein 
mixtures using isotope-coded aBBnity tags" 1999) disclose the use of 'isotope encoded 
aflSnity tags' for the capture of peptides jfrom proteins, to allow protein e3q)ression 
analysis. In this article, the authors describe the use of a biotin linker, whicli is reactive to 
thiols, for liie capture peptides with cysteine in theia A sanple of protein from one 
source is reacted with the biotin linker and cleaved with an endopeptidase. The 
biotinylated cysteine-containing peptides can then be isolated on avidinated beads for 
subsequent analysis by mass spectrometry. Two sanoples can be coiiq)ared quantitatively 
by labelling one sample with the biotin linker and labelling the second saDGple with a 
deuterated form of the biotin linker. Bach peptide in the sanq>les is then represented as a 
pair of peaks in the mass spectrum. Integration of the peaks in the mass spectrum 
corresponding to each tag indicate the relative expression levels of the peptide linked to 
the tags. 

This 'isotope encoding* method has a number of limitations. A first is the reliance on the 
presence of thiols in a protein - many proteins do not have thiols while others have 
several. In a variation on this method, linkers may be designed to react with other side 
chains, such as amines. However, since many proteins contain more than one lysine 
residue, multiple peptides per protein would generally be isolated in this approach. It is 
likely that this would not reduce the complexity of the saiiq)le sufficiently for analysis by 
mass spectrometry. A sample that contains too many species is likely to suffer from 'ion 
suppression', in which certain species ionise preferentially over other species which would 
normally appear in the mass spectrum in a less con:;>lex sanq)le. In general, capturing 
proteins by their side chains is likely to give either too many peptides per protein or 
certain proteins will be missed altogelh^. 

The second limitation of this approach is the method used to coitq)are the expression 
levels of proteins from different samples. Labelling each san9)le with a different isotope 
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variant of the afiSnity tag results in aii additional peak in the mass spectnim for each 
peptide in each san5)le. This means that if two san5)Ies are analysed together there will 
he twice as many peaks in the spectrum- Sinoilarly, if three samples are analysed together, 
file spectrum will he three times more CQn5)lex than for one satcple alone. It is clear that 
this approach will he limited, since the ever increasing numbers of peaks will increase the 
likelihood that two different peptides will have overlapping peaks in the noass spectrum. 

A furthOT limitation, which is reported by the authors of the above paper, is the mobility 
change caused by the tags. The authors report that peptides labelled with the deuterated 
hiotin tag elute shghtly after the same peptide labelled with the undeuterated tag. 

The mass spectra generated for analyte material are very sensitive to contaminants. 
Bssentially, any material introduced into the mass spectrometer that can ionise will ^pear 
in the mass spectrumu This means that for many analyses it is necessary to carefully purify 
the analyte before introducing it mto the mass spectrometer. For the purposes of high 
throughput systems for indirect analysis of analytes through mass labels it would be 
desirable to avoid any umiecessaiy saniple preparation steps. That is to say it would be 
desirable to be able to detect labels in a background of contaminating material and be 
certain that the peak that is detected does in feet correspond to a label. The prior art does 
not disclose methods or conq)ositions that can inprove tihe signal to noise ratio achievable 
in mass spectrometry based detection systems or that can provide confirmation that a mass 
peak ia a spectrum was caused by the presence of a mass label. 

For the purposes of detection of analytes after liquid chromatography or electrophoretic 
separations it is desirable that the labels used, minimally interfere with the separation 
process. If an array of such labels are used, it is desirable that the effect of each member 
of the array on its associated analyte is the same as every othor label. This conflicts to 
some extent with the intention of mass naarkmg which is to generate arrays of labels that 
are resolvable in the Tnagg spectrometer on the basis of their mass. It is disclosed in the 
prior art above that mass labels should preferably be resolved by 4 Daltons to prevent 
interference of isotope peaks &om one label with those of another labeL This naeans that 
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to geaerate 250 distmct mass labels would require labels spread over a range of about 
1000 Daltons and probably more, since it is not trivial to generate large arrays of labels 
separated by exactly 4 Daltons. This range of mass wiQ almost certainly result in noass 
labels that will have a distinct effect on ai^ separation process that precedes detection by 
mass spectrometry. It also has inplications for instrument design, in that as the mass 
range over which a mass spectrometer can detect ions inoreases, the cost of the instrument 
increases. 

It is thus an object of this invention to solve the problems associated with the above prior art, 
and to provide mass labels which can be detected in a background of contamination and 
whose identity as mass labels can be confirmed. Furthermore it is an object of this 
invention to provide arrays of labels which can be resolved in a compressed mass range so 
that the labels do not mterfere as much with separation processes and which can be 
detected easily in a mass spectrometer ihat detects ions ov^ a limited range of mass to 
charge iratios. 

It is also an object of this invention to provide methods of analysing biomolecules which 
. exploit the labels of this invention to maximise throughput, signal to noise ratios and 
sensitivity of such assays, particularly for the analysis of peptides. 

In a first aspect the invention provides a set of two or more mass labels^ each label in the 
set coinprising a mass marker moiety attached via at least one amide bond to a mass 
normalisation moiety, wherein the aggregate mass of each label in the set may be the 
same or dififerent and the mass of the mass marker moiety of each label in the set may be 
the same or different, and wherein in any group .of labels within tiie set having a mass 
marker moiety of a common mass each label has an aggregate mass different from all 
other labels in that groiip, and wh^ein in any group of labels wilhin the set having a 
coitnnon aggregate mass each label has a mass marker moiety having a mass different 
iGrom that of all other mass marker moieties in that group, such that all of tiie mass labels 
in the set are distinguishable from each other by mass spectrometry, and herein the mass 
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marker moiety comprises an amino acid and ih& mass normalisation moiety comprises an 
amino acid. 

The temi mass marker moiety used in tiie present contejct is intended to refer to a moiety 
that is to be detected by mass spectrometry, whilst the term mass nonnalisation moiety 
used in the present context is intended to refer to a moiety that is not necessarily to be 
detected by mass spectrometry, but is present to ensure that a mass label has a desired 
aggregate mass. The number of labels in the set is not especially limited, provided that 
the set comprises a plurality of labels. However, it is preferred if the set con^nises two or 
more, three or more, four or more, or five or more labels. 

The present mvention also provides an array of mass labels, conqnising two or riiore sdts 
of mass labels as defined above, wherein the aggregate mass of each of the mass labels in 
any one set is different firom the aggregate mass of each of the mass labels in every other 
set in the array. The mass marker moiety and the mass ncnmialisation moiety both 
C0n:pnse at least one amino add. However, the moieties may cotrpises fiirCher groitps, if 
desired, such as more anmio acid groups, and/or aryl ether groups. Thus the moieties may 
be modified amino adds, or may be peptides. The masses of the different sets in the array 
may be distinguished by adding iiirther amino add groups to either or both of the moieties 
as required. 

Further provided by the invention is a method of analysis, which method con5)rises 
detecting an analyte by identifying by mass spectrometry a mass label or a combination of 
mass labels unique to the analyte, wherein the mass label is a mass label firom a set or an 
array of mass labels as defined above. 

In certain embodiments of this invention tiie mass tags may conq)rise reactive 
fimctionalities which facilitate the attachment of the mass tags to analyte molecules. The 
tags in this embodiment are preferably of the following form: 



amino addl -axnide bond -amino add 2 -reactive functionality 
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where the mass marker moiety and flie mass normaKsaticm moiety may each be either 
amino add 1 or amino add 2. 

In preferred embodiments of tte inveatian, the array of tags are preferably all chemically 
identical and the masses of the mass normaliisation and mass marker moieties (e.g. amino 
acid 1 and add 2 above) are altered by isotope substitutions. 

In further preferred embodiments of this invention, the tags liiay comprise a sensitivity 
enhancing groxip. The tags are preferably of the form: 

sensitivity enhancing group - amino add 1 - amide bond - amino add 2 - reactive 

functionality 

In this exarople liie sensitivity enhancing groiq) is usually attached to Ihe mass marker 
moiety, since it is intended to increase the sensitivity of the detection of this moiety in the 
inass spectrometer. The reactive fimcticMiality is shown as being present and attached to a 
different moieity than the sensitivity enhancing group. However, the tags need not be 
limited in this way and in some cases conxprise the sensitivity enhancing group without 
the reactive functionality. In other embodiments the sensitivity enhancing group may be 
attached to the same moiety as the reactive functionahty. 

In certain embodiments* of the invention the mass tags comprise an affinity capture 
reagent Preferably, tiie afSnity capture ligand is biotin. The affinity capture ligand 
allows labelled analytes to be separated jfrom unlabelled analytes by capturing them, e.g. 
on an avidinated solid phase. 

hi a further aspect the invention provides a method of analysing a biomolecule or a 
mixture of biomolecules. This method preferably conpises the steps of 
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1. Reacting the biomolecule or mixture of bicmolecules with a mass marker 
according to this inveation; 

2. Optionally separating Hie labelled biomolecule electrophdreticany or 
chromatographically; 

3. Ionising the labelled biomolecule; 

4. Selecting ions of a predetermined mass to charge ratio corresponding to the mass 
to charge ratio of the preferred ions of the labelled biomolecule in a mass analyser; 

5 . Inducing dissociation of these selected ions by collision; 

6. Detecting the collision products to identify collision , product ions that are 
indicative of the mass labels. 

In this embodiment, where the mass tags conqirise an affinity tag, the afOnity tagged 
bicmolecules naay be captured by a counter-ligand to allow labelled bicmolecules to be 
separated from unlabelled biomolecules. This step preferably takes place prior to the 
optimal second step above. 

In certain enibodiments the step of selecting the ions of a predeteimiQed mass to charge 
ratio is perfomied in the first mass analyser of a serial instrument The selected ions are 
then channelled into a separate collision cell where they are collided with a gals or a solid 
surface according to the fourth step of the first aspect of the invention. The collision 
products are then channelled into a further mass analyser of a serial instrument to detect 
collision products according to the fiftii step of the first aspect of this invention. Typical 
serial instruments include triple quadncpole mass spectrometers, tandem sector 
instruments and quadrupole time of flight mass spectrometers. 

In other enibodiments, the step of selecting the ions of a predetenninied mass to charge 
iratio, the step of colliding the selected ions with a gas and the step of detecting the 
collision products are performed in the same zone of the mass spectrometer. This may 
ejSfected in ion tr^ mass analysers and Fourier Transform Ion Cyclotrcm Resonance mass 
spectrometers, for exan^le. 
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Iq another aspect, this mveation provides sets or arrays of mass labelled molecules of the 
form: 

analyte - linker - label 

where label is a mass markesr from a set or array according to this iavecation, the linker is a 
linker as described below and analyte may be any analyte of interest such as a 
biomolecule. One preferred aspect of this embodiment is where the analytes (one, more 
than one or even all the analytes) in the set or array are standard analytes with a known 
mass or with predetermined chromatographic prop^es. Such standards can be 
employed in the nietiiods of the present invention for contparison with unknown analytes, 
for example when analysing the results of a chromatographic separation step. 

This invention describes mass markers that xDsy be readily produced in a peptide 
synthesiser. Indeed, the conapounds used in this invention coixpises peptides and 
modified peptides. Peptide synth^s provides chemical diversity allowing for a wide 
range of naaikers with chosen properties to be produced in an automated fashion. 

The term 'MS/MS' in the context of naass spectrometers refers to mass spectrometers 
capable of selecting ions, subjecting selected ions to Collision Induced Dissociation (CID) 
and subjecting the firagment ions to further analysis. 

The term ^serial instrument' refers to mass spectronoeters capable of MS/MS ia which 
mass analysers are organised in series and each step of flie MS/MS process is performed 
one after the other in linked mass analysers. Typical serial instruments include triple 
quadrupole mass spectrometers, tandem sector instruments and quadmpole time of fight 
mass spectrometers. 

The invention will now be described in further detail by way of exanople only, with 
reference to the accompanying drawings, in which: 
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Figure 1 shows a set of 3 mass tags derived fixm lysine; 

Figure 2 shows a set of S mass tags derived from alanine; 

Figure 3 shows a set of 5 mass tags derived from alanine and tyrosine; 

Figure 4 shows a set of 4 mass tags derived from fluorinated forms of phenylglycine; 

Figure 5 shows a set of 4 mass tags derived from fluorinated forms of phenylglycine and 
phenylalanine; 

Figure 6a shows a set of 2 afiSluty Ugand mass tags derived from methionine with a 
hydrazide fimcdonality for labelling carbohydrates; 

Figure 6b shows a set of 2 afiBnity ligand mass tags derived from methionine with a 
boronic add functionality for labelling carbohydrates; 

Figure 7 shows a set of 2 aflfinily ligand noass tags derived from methionine with a thiol 
ftinctionaUty for labelling dehydroalanine and methyldehydroalanine residues; 

Figure 8 shows a set of 2 afBnity ligand mass tags derived from methionine with a 
irialermide functionality for labelling free thiols; 

Figure 9a shows a synthetic pathway for the preparation of an FMOC protected, 
deuterated methionine residue and figure 9b shows a synthetic pathway for the 
preparation of areactive linker that can act as a sensitivity enhanceo^ 

Figure 10 shows a pair of example peptides derived from different isotopic forms of 
methionine synthesised to demonstrate the features of this inventioo^ 
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Figure 1 1 shows an electrdspray mass spedrum of a mixture of the two peptides shown in 
Figure 10; 

Figure 12 shows an electrospray spectrum of the fragmentation of each of the two 
peptides shown in Figure 1 0; . 

Figure 13 shows a hypothetical fragmentation mechanism that is likely to account for the 
spectra shown in Figures 12 and 14; 

Figure 14 shows an. electrospray spectrum of the fragmentation of a 70:30 mixture of the . 
two peptides shown in Figure 1 0; 

Figure 15 shows a graph displaying flie expected ratios of peptides A and B (Figure 10) 
against observed ratios of peptides A and B found in a series of ESI-MS/MS analyses of 
mixtures of A an B; 

Figures 16a-16c depict proposed fragmentation mechanisms; 

Figures 17a-17d illustrate tags which exploit enhancing cleavage at the cleavable amide 
boixd; 

Figures 18a and 1 8b show the structures of two versions of the TMT mark^s; 

Figure 19a and 19b show typical QD spectra for a peptide labelled with the first 
generation TMT at collision energies of 40V (Figure 19a) and 70V (Figure 19b); 

Figure 20a 20b and 20c show MS and MS/MS spectra for triply charged ions of the / 
pqptide 2 (see Table 7) labelled wifli the first and second generation TMTs; 

Figure 21 shows a typical QD spectra for a peptide peptide 2 in Table 7) labelled with a 
second generation TMT; 
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Figure 22 shows that the charge state of the TMT tagged peptide does not affect the 
g^jpearance of the tag fragments in the CD3 spectra of the labelled peptides; 

Figure 23 shows peptide mixtures with the expected and measured abundmce ratios for 
both the first and second generation tags; 

Figure 24 shows the co-elution of each peptide pair, peptides A and B for each peptide 
from Table 7; 

Figure 25 shows a dynamic range study of TMT peptide pairs 3A/3B, which are present 
in a ratio of 40:60 and have been analysed at dilutions in the raaage from 100 frnole to 
lOppmole; and 

Figures 26a 26b and 26c show the results of a spiking expedment in which peptides pairs 
3A and 3B (500 finol in total, in a ratio of 40:60 respectively) bearing a second generation 
TMT was mixed with a tryptic digest of Bovine SOTJm Albumin (2 pmol). 

Figures 1 to 5 illxxstrate a nuniber of important features of the tags of tbds invention. The 
tags in all of figures 1 to 5 are shown linked to a 'reactive functionality*, which could be a 
linker to an N-hydroxysuccinimide ester for examplo or any of a number other 
possibilities some of which are discussed below. Figures 1, 2 and 4 show that a number 
of tags can be generated by combining different mass modified forms of the same amino 
acid into a series of dipeptides. Figures 3 and 5 show sets of tags, which are created by 
combining differmt amino acids in heterodimers. Figures 1 to 3 illustrate tags, which all 
have the same total mass and which are chemically identical. These tags differ in the 
distribution of isotopes in the molecules, while Figures 4 and 5 which all have the same 
total mass but which are not chemically identical, these tags differ in the distribution of 
fluorine substituents in the tags. 
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Figure 1 will now be discussed in more detail. Figure 1 shows 3 hcmodimers of lysine. 
The lysine has been blocked at the epsilon amino ffoxxps with noyethylsu^honyl chloride. 
The stdphonamida linkage is more resistant to jfragmentatton than a conventional amide 
linkage, so that the capping group will not be lost when the tag is fragmented in a noass 
spectrometer using collision induced dissociation at energies sufficient to cleave the 
conventional backbone amide bond between the pair of modified lysine residues. The 
capping group is iised to inhibit protonation at the epsilon position during ionisation of 
the tags in a mass spectronieter. The capped lysine can be prepared prior to synthesis of 
the mass tags. The epsilon amino group can be selectively modified by coupling the 
amino add with melhylsulphonyl chloride in the presence of copper ions, for example. 
Aitnine and add functionalities at the alpha position can form chelates with various 
divalent cations making the alpha anoino group unreactive. The alpha-amino group of the 
dipeptide has been converted to a guanidino-group to promote protonation at this position 
in the tag during ionisation in a idass spectrometer and to differentiate the mass of tiie 
fi:agmentation product from the second alamne residue and natural alanine residues in 
protein. The goanidination of the alpha-position can be perfomied as the last step of a 
conventional peptide synthesis before deprotection of the peptide and cleavage from the 
resin (Z. Tian and R.W. Roeske, Int J. Peptide Protein Res. 37: 425-429, "Guanidination 
of a peptide side chain amiao group on a solid support", 1991). Different deuterated 
forms of lysine would be used to prepare the three different tags. The total mass of each 
of tiie three tags is the same but the N-tenninal lysine in each tag differs from the other 
two by at least four Daltons. This mass difference is usually sujSident to prevent natural 
isotope peaks from fragmented portioiis of each tag from overlapping in the mass 
spectrum with the isotope peaks of the firagmented portions of olher tags. 

Figure 2 will now be discussed in more detail. Figure 2 shows S homodimers of alaoine. 
Different isotopically substituted forms of alanine would be used to prqpare the five 
different tags. The total mass of each of the five tags is the same but the N-terminal 
alanine in each tag differs from the other four by at least one DaltoiL The alpha amino 
groiQ) of the dipeptide tag has been methylated to differentiate the firagmentation product 
of this amino add from the fi:agmentation product of the second alanine residue and the 
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natural alanine residues in the protein and to promote protonation at this position in Ihe 
tag during ionisation in a mass spectrometer. 

Figure 3 will now be discussed in more detail. Figure 3 shows 5 heterodimers of alanine 
and tyrosine. Different isotopically substituted forms of alanine and tyrosine would be 
used to prepare the five different tags. Tlae total mass of each of the five tags is the same 
but tiie N-terminal alanine in each tag differs firom the other four by at least one dalton. 
The alpha amino group of the dipeptide tag has been methylated to differentiate the 
fi*agmentation product of this amino acid from the fi-agmentation products of natural 
alanine residues in the protein and to promote protonation at this position in the tag during 
ionisation in a mass spectrometer. 

Figure 4 will now be discussed in more detail. Figure 4 shows 4 dimers of phenylglydne. 
Diff^ent fluorine substituted forms of phenylglycane would be used to prepare the 4 
different tags. ^ The total mass of each of the 4 tags is the same but the N-terminal 
phenylglycine in each tag differs from the other 3 tags by the mass of at least one fluorine 
atom. The alpha amino group of the dipeptide tag has been methylated to differentiate the 
fiagmentation product of this amino acid from the fragmentation product of the second 
phenylglycine residue and to promote protonation at this position in the tag during 
ionisation in a mass spectrometer. 

Figure 5 will now be discussed in more detail. Figure 5 shows 4 dimers comprising 
phenylglycine and phenylalaoine. Different fluorine substituted forms of phenyl^ycine 
and phraylalanine would be used to prepare the 4 different tags. The total mass of each 
of the 4 tags is the same but the N-terminal alanine in each tag differs from the oth^ 3 
tags by the mas^ of at least one fluorine atom. The alpha amino group of the dipeptide tag 
has been naethyiated, although this serves only to protect the amino group from side 
reactions and to increase protonation as it is not necessaiy to differentiate the first amino 
add as the firagmentation product without mefhylation would be different from the second 
amino acid residue of the tag peptide. The alpha amino group could be riiodified to 
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promote protonation at fliis position in the tag during idnisation in a mass spectrometer by 
methylation or guaoidination if this is desirable. 

The present mvention will now be described in more detail. In one preferred 
embodiment, the preset invention provides a set of mass labels as defined above, in 
which each label in the set has a mass marker moiety having a common mass and each 
label in the set has a unique aggregate mass. 

In an alternative, more preferred embodiment, each label in the set has a common 
aggregate mass and each label in the set has a mass marker moiety of a imique mass. 

The set of labels need not be limited to the two preferred embodiments described above, 
aiid may for cKmsple concise labels of both types, provided that all labels are 
distniguishablebyiiiass spectrometry, as outlined above- 
It is preferred that, in a set of labels of the second type, each mass marker moiety in tile 
set has a common basic structure and each mass normalisation moiety in the set has a 
common basic structure, and each mass label in the set conqjrises one or more mass 
adjuster moieties, the mass adjxister moieties being attached to or situated within the basic 
structure of the mass marker moiety and/or the basic structure of the mass normalisation 
moiety. In this embodimeat, every mass marker moiety in the set con^rises a dijSerent 
number of mass adjuster moieties and every mass label in the set has the same nuinber of 
ma^ adjuster moieties. 

Throughout this description, by cdimnon basic structure, it is meant that two or more 
moieties share a structure wliich has substantially the same structural skeleton, baclione 
or core. This skeleton or backbone may be for exanq)le comprise one or more amino 
acids. Preferably the skeleton conqxrises a nuinber of amino acids linked by amide bonds. 
However, other units such as aryl ether units may also be present The skeleton or 
backbone may con5>rise substituents* pendent firom it, or atomic or isotopic replacements 
within it, without changing the common basic structure. 
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Typically, a set of niass labels of Hie second type referred to above con^irises niass labels 
with the formula: 

M(A)3rL.X(A)z 

wherein M is the mass normalisation moiety, X is the mass marker moiety, A is a mass 
adjuster moiety, L is the cleavable hnker con^jrising the amide bond, y and z are integers 
of 0 or greater, and y+z is an integer of 1 or greater. Preferably M is a fragmentation 
resistant group, L is a linker that is susceptible to fragmentation on collision with another 
molecule or atom and X is preferably a pre-ionised, firagmentation resistant group. The 
sum of the masses of M and Xis the same for all members of the set Preferably M and X 
have the same basic structure or core structure, this structure being modified by the mass 
adjuster moieties. The mass adjuster moiety ensures that the sum of the masses of M and 
X in is the same for all mass labels in a set, but ensures that each X has a distinct (unique) 
mass. 

The present invention also encompasses arrays of a plurality of sets of roass labels. The 
arrays of mass labels of the present invention are not particularly limited, provided that 
they contain a plurality of sets of mass labels according to the present mvention. It is 
preferred that the arrays comprise two or no^e, toee or more, four or more, or five or 
more sets of mass labels. Preferably each mass label in the array has either of the 
following stmctures: 

(S)x-M(A)3rI^X(A)z 
M(A)y-(S)x-L-X(A)z 

wherein S is a mass series modifying group, M is the mass normalisation moiety, X is the 
mass marker moiety, A is the mass adjuster moiety, L is the cleavable linker comprising 
the amide bond, x is an integer of 0 or greater, y and z are integers of 0 or greater, and yfz 



wo 03/025576 



PCT/GB02/04240 



17 

isanintegerofl or greater. The mass series modiJ^g groiip sepa^ 

sets from each other. This group may be any ^e of groiip, but is preferably 

acid, or aryl ether group. Sets may be separated in mass by comprising a different number 

of amino adds in their moieties than otho: tags from diflferent sets. 

Linker Groups 

In the discussion above and below reference is made to linker groups which may be used 
to comiect molecules of interest to the mass label conipounds of this invention. A variety 
of linkers is known in the art which may be introduced between the mass labels of this 
invention and their covalehtly attached analyte. Some of these linkers may be cleavable. 
Oligo- or poly-ethylene glycols or their derivatives may be used as linkers, such as those 
disclosed in Maskos, U. 8l Southern, BM. Nucleic Acids Research 20: 1679 -1684, 1992. 
Succinic add based linkers are also widely used, althougji these are less prieferred for 
applications involving the labelling of oligonucleotides as they ate generally base labile 
and are thus inconcjiatible with the base mediated de-protection steps used in a number of 
oligonucleotide synthesisers. 

Propargylic alcohol is a bifunctional linker that provides a linkage that is stable under the 
conditions of oligonucleotide synthesis and is a preferred linker for use with this 
invention in relation to oligonucleotide applications. Sinndlarly 6-amiaohexanol is a 
useful bifunctional reagent to link appropriately functionalised molecules and is also a 
preferred linker. 

A variety of known cleavable linker groups may be used in conjunction with the 
compounds of this invention, such as photocleavable linkers. Ortho-nitrobenzyl groups 
are known as photocleavable linkers, particularly 2-nitrobmzyl esters and 
2-nitrobenzylan3ines, which cleave at the benzylamine bond. For a review on cleavable 
linkers see Lloyd-Williams et al, Tetrahedron 49, 11065-11133, 1993, which covers a 
variety of photocleavable and chemically cleavable linkers. 
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WO 00/02895 discloses the vinyl sulpliQne c(MH)ounds as cleavable liricers, wbich are 
also applicable for use with this invention, particularly in 25)plications involving Ihe 
labelling of polypeptides, peptides and amino acids. The content of this application is 
incorporated by reference. 

WO 00/02895 discloses flie use of silicon compounds as linkers that are cleavable by base 
in the gas phase. These linkers are also applicable for use with this iavention, particularly 
in applications iavolving the labelling of oligonucleotides. The content of this application 
is incorporated by reference. 

It has been mentioned above that the mass labels of the present invention may conqirise 
reactive functionalities. Re, to help attach them to analytes. In preferred embodiments of 
the present invention, Re is a reactive functionality or group which allows the mass label 
to be reacted covalentiy to an appropriate fimctional group in an analyte molecule, such 
as, but not limited to, a nucleotide oligonucleotide, polynucleotide, amino add, peptide or 
polypeptide. Re may be attached to the mass labels via a linker which may or may not be 
cleavable. A variety of reactive functionalities may be introduced into the mass labels of 
this invention. 

Table 1 below lists some reactive functionalities that may be reacted with nucleophilic 
fimctionalities which are found in biomolecules to generate a eovalent linkage between 
the two entities. For applications involving synthetic oligonucleotides, primaiy amines or 
thiols are often introduced at the termini of the molecules to peraiit labelling. Any of the 
functionalities listed below could be introduced into the compounds of this invention to 
permit the mass markers to be attadied to a molecule of interest A reactive fimctionality 
can be used to introduce a fvuiher linker groups with a further reactive functionality if that 
is desired. Table 1 is not intended to be exhaustive and the present invention is not 
limited to the use of only the listed functionalities. 
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. Table 1 






1 Nudeophilic Functionality 


Reactive Functionality 


Resultant Linking Group 


-SH 


-S02-CH=CR2 


-S-CaR.2-CH2-S02- 


-NH2 


-S02-CH=CR2 


-N(CR2-CH2-S02-)2 or 
-NH-CR2-CH2-SO2- 


-NH2 




-CO-NH- 


-NH2 


( 7 

\=/ 


-CO-NH- 


-NH2 


-NCO 


-NH-CO-NH- 


-NH2 


-NCS 


-NH-CS-NH- 


-NH2 


-CHO 


-CH2-NH- 


-NH2 


-SO2CI 


-oU2*JNrl- 


-NH2 


-CH=CH- 


-NH-CH2-CH2- 


-OH 


-OP(NCH(CH3)2)2 


.0P(=0)(0)0- 



It should be noted that in applications involving labelling oligonucleotides with the mass 
markers of this invention, some of the reactive fimctionalities above or their resultant 
linking groTq)s might have to be protected prior to introduction into an oligonucleotide 
synthesiser. Preferably unprotected ester, thioether and thioesters, amine and anaide 
bonds are to be avoided, as these are not usually stable in an oligonucleotide synthesiser. 
A wide variety of protective gfoisps is kaown in the art which can be used to protect 
linkages from unwanted side reactions. 

In the discussion below reference is made to "charge carrying functionalities" and 
solubilising groups. These groups may be introduced into the xdbss labels such as in the 
mass markers of the invention to promote ionisation and solubility. The choice of 
markers is dependent on whether positive or negative ion detection is to be used. Table 2 



wo 03/025576 



20 



PCT/GB02/04240 



below lists some functionalities that inay be introduced into mass markers to promote 
either positive or negative ionisatioiL The table is not intended as an exhaustive list, and 
the present invention is not limited to the use of only the listed function^ties. 



Table 2 




Positive Ion Mode 


Negative Ion Mode 


.NH2 


-SO3- 1 


-NR2 


-PO4- 




-PO3- 




-CO2- 






-SR2+ 





WO 00/02893 discloses the use of metal-ion binding moieties such as crown-ethers or 
porphyrins for the purpose of improving the ionisation of mass markers. These moieties 
are also be applicable for use with the mass markers of this invention. 



The conoponents of the mass markers of this invention are preferably fragmentation 
resistant so that the site of fragmentation of the markers can be controlled by the 
introduction of a linkage that is easily broken by Collision Induced Dissociation (C3D). 
Aryl ethers are an example of a class of fragmentation resistant coiiq)Ounds that may be 
used in this invention. These coi3:5)Ounds are also chemically inert and thermally stable. 
WO 99/32501 discusses the use of poly-ethers in mass spectrometry in greater detail and 
tiie content of this apphcation is incorporated by reference. 

In the past, the general method for the synthesis of aryl ethers was based on the Ullmann 
coupling of arylbromides with phenols in the presence of copper powder at about 200°C 
(representative reference: H. Stetter, G. Duve, Chemische Berichte 87 (1954) 1699). 
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Milder mefliods for the synthesis of aryl eihers have been developed using a different 
metal catalyst but the reaction ten5)erature is still between 100 and 120**C (M lyoda, M. 
Sakaitani, H. Otsuka, M. Oda, tetrahedron Letters 26 (1985) 477). This is a preferred 
route for the production of poly-ether mass labels. See synthesis of FT?? given in the 
exaii5)les below. A recently published method provides a most preferred route for the 
generation of poly-ether mass labels as it is carried out under much milder conditions than 
the earlier methods (D. E. Evans, J. L. Katz, T. R- West, Tetrahedron Lett. 39 (1998) 
2937). 

The present invention also provides a set of two or more probes, each probe in the set 
being different and bemg attached to a unique mass label or a unique combination of mass 
labels, from a set or an array of mass labels as defbaed as defined above. 

Further provided is an array of probes comprising two or more sets of probes, wherein 
each probe in any one set is attached to a unique mass label, or a unique combination of 
mass labels, firom a set of mass labels as defined above, and wherein the probes in any one 
set are attached to mass labels firom the same set of mass labels, and each set of probes is 
attached to mass labels firom unique sets of mass labels firom an array of mass labels as 
defined above. 

In one embodiment, each probe is preferably attached to a unique combination of mass 
labels, each coiribination being distinguished by the presence or absence of each mass 
label in the set of mfl^g labels and/or the quantity of each mass label attached to the probe. 
This is termed the "noixing mode" of the present invention, since the probes may be 
attached to a niixture of mass labels. 

In the above aspects, the nature of the probe is not particularly limited. However, 
preferably each probe comprises a biomolecule. Any biomolecule can be employed, but 
the biomolecule is preferably selected firom a DNA, an BNA, an oligonucleotide, a 
nucleic acid base, a peptide, a polypeptide, a protein and an asnino add. 
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la one preferred embodiment, this invention provides sets and arrays of mass labelled 
analytes, such as nucleotides, oligonucleotides and polynucleotides, of the form: 

analyte- linker -label 

Wherein the linker is a linker as defined above, and label is a mass label from any of Ute 
sets and arrays defined above. 

In the above aspect, the nature of the analyte is not particularly limited. However, 
preferably each analyte con^nises a biomolecule. Any biomolecule can be eirployed, but 
the biomolecule is preferably selected fix>m a DNA, an RNA, an oligonucleotide, a 
nucleic acid base, a peptide, a polypeptide, a protein and an aiiiino add. 

In one embodiment, each analyte is preferably attached to a unique coixibination of mass 
labels, each combination being distinguished by the presence or absence of each mass 
label in the set of mass labels and/or the quantity of each mass label attached to the probe. 
As mentioned above, this is termed the "mixing mode" of Ihe present invention, since the 
inrobes may be attached to a mixture of mass labels. 

As mentioned above, the present invention provides a method of analysis, which method 
cornprises detecting an analyte by identifying by mass spectrometry a mass label or a 
combination of mass labels unique to the analyte, wherein the mass label is a mass label 
firom a set or an array of mass labels as defined above. The type of method is not 
particularly limited, provided that the method benefits firom the use of the mass labels of 
the present invention to identify an analyte. The method may be, for example, a method 
of sequencing nucleic add or a method of profiling the expression of one or more genes 
by detecting quantities of protein in a sanople. The method is especially advantageous, 
since it can be used to readily analyse a plurality of analytes simultaneously. However, 
the method also has advantages for analysing single analytes individually, since using the 
present mass labels, mass spectra which are cleaner tiian conventional spectra are 
produced, making the method accurate and sensitive. 
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In a fiftther prejEated embodiment, the present invention provides a method which method 
coniprises: 

(a) contacting one or more analytes with a set of probes, or an array of probes, 
each probe in the set or array being specific to at least one analytei wherein Ihe probes are 
as defined above, 

(b) identifying an analyte, by detecting the probe q)ecij5c to that analyte. 

In this embodiment it is preferred that the mass label is cleaved from the probe prior to 
detecting the mass label by mass spectrometry* 

The nature of the methods of this particular embodiment is not especially limited 
However, it is preferred that the method conpises contactmg one or more nucleic acids 
with a set of hybridisation probes. The set of hybridisation probes typically con^mses a 
set of up to 256 4-mers, each probe in the set having a different combination of nucleic 
acid bases. This method may be suitable for identifying Ihe presence of taarget nucleic 
acids, or alternatively can be used in a stepwise method of primer extension sequencing of 
one or more nucleic add templates. 

The mass labels of the present invention are particularly suitable for use in methods of 
2-dimensidnal analysis, primarily due to the large number of labels that can be 
simultaneously distinguished. The labels may thus be used in a method of 2-dimensional 
gel electrophoresis, or in a method of 2-dimensional mass spectroinetry. 

Peptide Synthesis 

the syafhesis of many examples of the peptide mass tags of this invention will be possible 
using conventional peptide synthesis me&ods and commercially available reagents. 
Modified amino acids that are not commercially available are also contenqilated for the . 
synthesis of fiirther peptide mass tags. 
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Modem peptide synthesis is typically carried out (m solid phase sqjports in automated 
synthesiser instruments, which deliver all the necessary reagents for each step of a peptide 
synthesis to the solid support and remove spent leagents and unreacted excess reagents at 
the end of each step in the cycle. Solid phase peptide synthesis is, however, often 
performed manually, particularly when specialist reagents are being tested for^the first 
time. In essence peptide synthesis involves the addition of N-protected amino acids to the 
solid siq>port. The peptide is normally synthesised with the C-tenninal carboxyl group of 
the peptide attached to the support, and the sequence of the peptide is built from the C- 
terminal amino add to the N-terminal amino acid. The C-terminal amino acid is coupled 
to the support by a cleavable linkage. The N-protected alpha amino group of each amino 
add is deprotected to allow coupling of the carboxyl group of the next amino acid to the 
growing peptide on the solid support For most purposes, peptide synthesis is performed 
by one of two different synthetic procedures, vMch are distinguished by the conditions 
needed to remove flie N-protecting group. The tert-butyloxycarbonyl (t-BOC) gcoyxp is 
cleaved by mildly addic conditions, e.g. triQubroacetic add in dichloromethane, while the 
fluorenylmethoxycarbonyl (FMOC) group is cleavedby mildly basic conditions, e.g. 20% 
piperidine in dimethylformamide. Reactive side chains in amino adds also need 
protection during cycles of amide bond formation. These side chains include the epsilon 
amino group of lysine, the guanidino side-chain of arginine, the thiol functionality of 
cysteine, the hydroxyl functionalities of serine, threonine and tyrosine, the indole ring of 
tryptophan and the imidazole ring of histidine. The choice of protective groups used for 
side-chain protection is determined by the cleavage conditions of the alpha-amino 
protection groups, as the side-chain protection groups must be resistant to the 
deprotection conditions used to remove the alpha-amino protection groups. A first 
protective group is said to be 'orthogonal' to a second protective gcoixp if the first 
protective gcovp is resistant to deprotection under the conditions used for the deprotection 
of the second protective group and if the deprotection conditions of the first protecting 
groiqp do not cause deprotection of the second protecting group. 

Examples of side-chain protection groups con5)atible with FMOC syntheses are shown in 
Table 3. 
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Tables 



Side Chain 


Protective Group 


Epsilon ammo group of lysine 


t-BOC group 


Guamdino-functionality of arginine 


Nitro group or 2,2,5,7,8- 
pentamethylchroman-6-sulplionyl group 


Inudazole ring of histidine 


t-Trityl group, tc -benzyloxymefhyl (Bom) 
group. 


Hydroxy! functionalities of serine, 
threonine and tyrosine 


Tert-butyl group 


Indole ring of tryptophan 


t-BOC 


Thiol functionality of cysteine 


trityl or benzyl group 


Amide functionalities of glutamine and 
asparagine 


Not usually necessary but Trityl group can 
be used for oxBxnpie. 


Carboxylic add functionalities of glutamic 
acid aiad aspartic acid. 


Tert-butyl group 


Thioether of methionine 


Sometimes protected as sulphoxide 



Other side-chain protective groups that are orthogonal to FMOC protection will be loiown 
to one of ordinary skill in the art and may be applied with this invention (see for oxsmple 
Fields GB. & Noble ILL., Lit J Pept Protein Res 35(3): 161-214, "Solid phase peptide 
synthesis utilizmg 9-fluorenylmethoxycarbonyl amino acids." 1990). 



Protection ffoups for reactive side-chain functionalities conorpatible with t-BOC syndesis 
are shown below in Table 4. 
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Table 4 



Side Chain 


Protective Group 


Epsilon amino group of lysine 


Benzyloxycarbonyl (Z) group 


Guamdino-functionality of argmine 


Not usually necessary but nitration can be 
used 


Imidazole ring of histidine 


71 - benzyloxymethyl (Bom) group. 


Hydroxyl fimctionalifies of serine, 
threonine and tyrosine 


DwZtyi group 


Hydroxyl fimctionality of tyrosine 


2-Bromobenzyloxycaxbonyl group 


Indole ring of tryptophan 


Not usually necessary 


Thiol functionality of cysteine 


Benzyl group 


Amide functionalities of glutamine and 
asparagine 


Not usually necessary 


Carboxylic add functionalities of glutamic 
acid and aspartic acid. 


BenTyl ester groiq) 


Thioellier of methionine 


Sometimes protected.as sulphoxide 



Again, the practitioner of ordinary fiVill in the art will be aware of other protective groups 
for use with reactive side chains that are orthogonal to t-BOC alpha amino protection. 
Various different solid supports and resins are commercially available for peptide 
synthesis using either the FMOC or t-BOC procedures (for a review of soHd supports see 
Meldal. M., Methods En2ymol 289: 83-104, *Troperties of solid supports." 1997). 



Mass modified amino acids* 

A variety of amino adds can be used in tiie mass noarker moiety and the mass 
nomDalisation moiety. Neutral amino adds are preferred in the wbss normalisation 
nooiety and charged amino adds are preferred in the noass marker moieties (since this 
&dlitates ionisation and increases sensitivity) e.g. in the position marked amino add 1 
and atrdno add 2 in Ihe first amd fourth embodiments of this inventioit A nuniber of 
conomerdally available isotopically mass modified amino adds are shown in Table 5 
below. Any combination of 1, 2 ,3, or 4 or more amiao adds from this list are preferred 
in each of the moieties according to the present invention. 
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tables 



Amino acid 


Isotope Forms 


Alanine 


CH3CH(NH2)^'C02H, CH3CD(NH2)C02H, 
CH3''CHO'NH2)C02H, CD3CH(NH2)C02H, CD3CD(NH2)C02H, 
GD3CH(NH2}"C02H, CD3"CH(NH2)C02^, 
^^CH3"CH(^T«JH2)"C02H 


Arginine 


rf^m),CNHCH,CH2CH(NH2)C02Hl* 


Asparagine 


H2N"COCH2CH(NH2)C02H. H2N"CO"CH2"CH(NH2)"C02H, 
H2^^COCH2CH(NH2)C02H,H2'^COCH2CHC'^2)C02H, 


Aspartic Acid 


H02"CCH2CH(NH2)C02H, H02C"CH2CH(NH2)C02H. 
H02CCH2CH(NH2)"C02H, HO2"CCH2CHCNH2)"C02H, 
HO2CCH2^^CH(NH2)"C02H, HO2^^C"CH2CH(NH2)C02H, 
HO2"C"CH2"CH(NH2)"C02H, H02CCD2CDCNH2)C02H, 
H02CCH2CB[(^^2)C02H. H02CCH2CH('^2)"C02H 


Cysteine 


Not available 


Glutamic Acid 


H02CCH2CH2CH(NH2)"C02H, H02CCH2CH2"CH(1SIH2)C02H, 
H02CCH2"CH2CH(NH2)C02H, HO2C"CH2CH2CH(lSIH2)C02H, 
H02"CCH2CH2CH(NH2)C02H, 
H02"C"CH2"ai2^^CH(NH2)"C02H, 

H02CCD2CH2CH(NH2)C02H, HO2CCD2CX>2CD(NH2)C02H, 
H02"C"CH2"CH2"CH('^2)"C02H 


Glutamiae 


H2NC0CH2CH2CH(1S1H2)"C02H, 

H2N"COCH2CH2CH(NH2)C02H, 

H2NCOCD2CD2CaD(NH2)C02H, 

H2^^COCH2CH2CHmH2)C02H, 

H2NCOCH2CH2CH('TOT[2)C02H, 

H2^^COCH2CH2CH0^2)C02H, 


Glycine 


HzNGHz^COjH, H2N"CH2C02H, H2N"CH2"C02H, 
H2NCD2CO2H, H2^^CH2C02H, H2^^"CH2C02H, 
H2^^CH2"C02H, Hz^'N^CHj^COjH 


Histidine 


(CH)2N2CCH2CH(NH2)"C02H, (CH)iN2CCH2CaaC'NH2)C02H, 
(CH)2*^2CCH2CH(NH2)C02H 


Isoleucine 


Not available 


Leucine 


(CH3)2CHCH2CH(NH2)"C02H, (CH3)2CHCH2"CH(NH2)C02H, 
(CH3)2CHCH2"CH(NH2)^^C02H, (CH3)2CHCH2CD(NH2)C02H, 
(CH3)2CHCD2C3)(NH2)C02jH, (CD3)(CH3)CHCH2CH(NH2) 
CO2H, (a>3)2CDCH2CHfl«JH2)C02H, 
(CD3)2CDCT)2CD(NH2)C02H, (CH3)2CHCH2CaH(^Tra2)C02EI, 
(CH3)2CHCH2CH(*^2)"caH 


Lysine 


H2NC3I2CH2C3I2CH2CHCNH2) CO2H, 

H2NCH2CH2CH2CH2"CH(NH2)C02H, 

H2N"CH2CH2CH2C3l2CH(NH2)C02H, 

H2NCH2CH2CH2CH2"CH(NH2)"C02H, 

H2NCH2CD2CD2CH2CH(NH2)C02H, 

H2NCD2CD2CX>2CD2CH(NH2)C02H, 
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H2NCH2CH2CH2CH2CHC'NH2)CX)2H, 
H2^*NGH2CH2CH2CH2CH(NH2)CX)2H, 
H,'^"CH2CH2CH,CH,.CH(NH2)C02H 


MetMcmine 


CH3SCH2CH2CH(NH2)^^002H, CH3SCH2CH2*^CH(NH2)C02H, 
"CH3SCH2CH2CH(NH2)C02H, CH3SCH2CH2CDg>IH2)C02H, 
CD3SCH2CH2CHCISIH2)C02H, CH3SCH2CH2CH(^T«l2)C02H, 
"C3D,Sai2CH2aJ(NH2)C02H,(H3SCH2CH2"aif^'NH2)C02H 


Phenylalanine 


C6HsCH2CH(NH2)"C02H. C6H5CH2"CH(NH2)C02H, 
>^C6H5CH2CH(NH2)C02H, C6H5CH2CD(NH2)C02H, 
C6H5C3D2CH(NH2)CX)2H, C6D5CH2CH(NH2)C02H, 
C«D,CD2CD(]S!Hi)C02H. C6H5eHjCH('^2)CO,H 


Proline 


B D N 


Serine 


HOCH2CH(NH2)"C02H, BOCai'^CSmSLdCOzB, 
HO"CH2CH(NH2)CX)2H. HOCH2CH(TSIH2)C02H, 
HOCH2"CH('^2)C02H 


Threoiune 


UJl3UJtllUxlJUJtltXNJtl2; V^U2Xl 


Tryptophan 


D 

D ' 


Tyrosine 


HO(C6H4)CH2CH(NH2)"C02H, HO(C*H4)CH2"CH(NH2)C02H, 

HO(G5BU)"CH2CH(NH2)C02H, 

HO(C6H4)"CH2"C3I(NH2)"C02H, 

HO("C6H4)CH2CH(NH2)C02H, 

HO("C6H4)''CH2''CH(NH2)"C02H, 

HO(C6H4)CD2CH(NH2)C02H, HO(C6D2H2)CH2CHg«l2)C02H, 
HO(C6D4)CH2CH(NH2)C02H, HO(C6H4)CH2C3I('^2)C02H, 
H"0(C6H4)CH2CH(NH2)C02H. H"0(C«H[4)CH2CH(NH2)C02H, 
H0(C6H4)CH2"CHC^2)C02H, 
H0("C^)'^CH,"CHC''NH2)"C02H 


Valine 


(CH3)2CHCH(NH2)"C02H, (CH3)2CH"CH(NH2)C02H, 
(CH3)2CHCDCNH2)C02H, (CD3)2CDCDCNH2)C02H, 
(C3I,)2CHCH(^'NH2)C02H 
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For many of the above ainino adds, both liie D- and L- forms are available (from 
ISOTEC Inc., Mianrisburg, Ohio for example), either of wMch may be used in the 
preparation of the tags of this inveation Mixtures of D and L forms are also available but 
are less preferred if the tags of this invention are to be used in chromatographic 
separations. For some, FMOC or t-BOC protected derivatives are also available. Mass 
modified amino acids based on substitution of deuterium for hydrogen and on substitution 
of ^^C and isotopes for ^^C and isotopes are also available and are equally 
^plicable for the synthesis of the tags of this invention. Various amino acids that are not 
typically found in peptides may also be used in the tags of this invention, for exaraple 
deuterated fdraas of amino-bu^c acid are commercially available. For the purposes of 
this invention non-radioactive, stable isotc^)es are preferred for safety reasons but there is 
no necessary limitation to stable isotopes. 

Fluoiinated derivatives of a number of amino adds are also available. Some of the 
coimnerdally available fluorinated amino acids are shown in Table 6 below. 



Table 6 



Amino acid 


Fluorinated Forms 


Glutamic Acid 


H02CCFHCH,.CH(NHj)CX)2H 


Leucme 


(CH3)(CF3)CHCH2CH(NH2)CX)2H 


Phenylalanine 


C6FH4CH2CH(NH2)C02H, C6F2H3CH2CH(NH2)C02H, 
CfiFgHiCHiCHrNHjKOiH 


FhenylglyCTxe 


C^CH(NH2)C02H, C6F2H3CH(NDH2)C02lI, 
C;5F3H2CH(NHi)C02H 


Valine 


(CH3)2C3FCHCNH2)C02H 



For most of the above fluorinated amino acids, the reagents are available as mixtures of D 
and L forms. In general, fluorinated variants of amino adds are less preferred than 
isotope substituted variants. The fluorinated conipounds can be used to generate a range 
of mass tags with the same mass but each tag will be chemically diflBsrent, which means 
that their behaviour in the mass spectrometer will vary more than isotope substitute tags. 
Moreover, the tags will not have identical chromatogr^hic properties if the tags are to be 
used in chromatographic separations. 
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Reactive Functionalities 

ia some aspects of this invention, as already esplained, the ixiass tags of the invention 
coniprise a reactive functionality. In the sinq>lest embodiments this may be an N- 
hydroxysucdniimde ester introduced by activation of the C-terminus of the tag peptides 
of this invmtion. In conventional peptide synthesis, this activation step would have to 
take place after the peptide mass tag has been cleaved from the solid suj^ort used for its 
synthesis. An N-hydroxysuccinimide activated peptide mass tag could also be reacted 
with hydrazine to give a hydrazide reactive functionality, which can be used to label 
periodate oxidised sugar moieties, for exan5)le. Amino-groups or thiols can be used as 
reactive functionalities in some applications and these may be introduced by adding lysine 
or cysteine after amino acid 2 of the tag peptide. Lysine can be used to couple tags to free 
carboTQrl functionahties using a carbodiinnde as a coixplrng reagent Lysine can also be 
used as the starting point for Ihe introduction of other reactive functionalities into the tag 
peptides of this invention. The thiol-reactive maleimide functionality can be introduced 
by reaction of Ihe lysine epsilon annno group with maleic anhydride. The cysteine thiol 
group can be used as the startmg point for the synthesis of a variety of alkenyl sulphone 
compounds, which are useful protein labelling reagents that react with thiols and amines. 
Compounds such as aminohexanoic acid can be used to provide a spacer between the 
mass modified amino acids and the reactive fiinctionaUty. 

Affinity Capture Ligands 

In certain embodiments of the first aspect of this invention the mass markers cori:5)rise an 
afBtnity capture ligand. Affinity capture hgands are ligands, which have highly specific 
binding partners. These binding partners allow molecules tagged with the ligand to be 
selectively c^tured by the binding partner. Preferably a solid support is derivitised with 
the binding partner so that afBnity ligand tagged molecules can be selectively captured 
onto the soUd phase support A preferred afBnity capture ligand is biotin, which can be 
introduced into the peptide mass tags of this invention by standard methods known in the 
art In particular a lysine residue may be incorporated after amino acid 2 through which 
an amine-reactive biotin can be linked to the peptide mass tags ( see for exanq)le Geahlen 
ILL. et al.. Anal Biochem 202(1): 68-67, ''A gmeral method for preparation of peptides 
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biotmylated at the carboxy tenmnus." 1992; Sawutz D.G. et al., Peptides 12(5): 1019- 
1012, "Synthesis and molecular characterizatian of a biotinylated analog of 
ILys]hradykmm." 1991; Natarajan S. et al., Int J Pept Protein Res 40(6): 567-567, "Site- 
specific biotmylation. A novel approach and its application to endothelin-1 analogs and 
PTH-analog.", 1992). Iniinobiotni is also applicable. A variety of avidin couirter-ligands 
for biotin are available, which include monomelic and tetrameric avidin and streptavidin, 
all of which are available on a number of solid si^ports. 

Other affinity capture ligands include digoxigenin, fluorescein, nitrophenyl moieties and a 
numbier of peptide epitopes, such as the c-myc epitope, for which selective monoclonal 
antibodies exist as counter-ligands. Metal ion binding Ugands such as hexahistidine. 
Which readily binds Ni^"*" ions, are also applicable. Chromatographic resins, which present 
iminodiacetic acid chelated Ni^ ions are commercially available, for exaniple. These 
immobilised nickel columns may be used to capture peptide mass ta^, which conoprise 
oligbmeric histidine. As a further alternative, an afiSnity c^ture functionality may be 
selectively reactive with an appropriately derivitised solid phase support Boronic acid, 
for exan5)le, is known to selectively react with vicinal ds-diols and chemically similar 
ligands, such as saKcylhydroxamic add. Reagents conspiising boronic add have been 
developed for protein capture onto soUd supports derivitised with salicylhydroxanaic acid 
(Stolowitz ML., et al., Bioconjug Chem 12(2): 229-239, 'Thenylboronic Add- 
Salicylhydroxamic Acid Bioconjugates. 1. A Novel Boronic Add Complex for Protein 
Lnmobilization.** 2001; Wiley J.P. et al., Bioconjug Chem 12(2): 240-250, 
*Thenylboromc Acid-Salicylhydroxamic Add Bioconjugates. 2. Polyvalent 
Immobilization of Protein Ligands for AfBnity Chromatography." 2001, Prolinx, Inc, 
Washington State, USA). It is antidpated tiiat it should be relatively sin5>le to link a 
phenylboronic add functionality to a peptide mass tag according to this invention to 
generate capture reagents that can be captured by selective chemical reactions. The use of 
this sort of chemistry would not be directly con5)atible with biomolecules bearing vicinal 
cis-diol-containing sugars, however these sorts of sugars could be blocked with, 
phenylboronic add or related reagents prior to ireaction with boronic add derivitised 
peptide noass tag reagents. 
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Mass Spec Sensitivity Enhancing Groups and Mass Diffferentiation 
Jn preferred einboditQents of the jBrst and fourth aspects of this invention the peptide naass 
tags conqmse Sensitivity Enhancing Groups. Figures 1 to 5 illustrate the use of 
methylation and guanidination as methods of irrproving sensitivity. In addition, these 
Sensitivity Enhancing Groi5)s can differratiate the fragmentation products of the N- 
tenninal amino acid from the fragmentation products of the second amino add in the 
peptide tag and natural amino acid residues in the protein, if this is the same as the jHrst 
amino acid. The sensitivity enhancing group can also distinguish Ihe fragmentation 
products of the N-terminal amino acid of tiie peptide mass tag from the fragmentation 
products of natural ainino acids when.the tags of this invention aire used to label pq)tides 
and proteins. The guanidind group and the tertiary ainino group are both useftil 
Sensitivity Enhancing Groups for electrospray mass spectrometry. 

Various other methods for derivatising peptides have been also been developed These 
include ihe use of quaternary ammonium derivatives, quaternary phosphonium derivatives 
and pyridyl derivatives for positive ion mass spectrometry. Halogenated coiqpounds, 
particularly halogenated aromatic compounds are well known electrophdres, i.e. they pick 
up thermal electrons very easily. A variety of derivatisation reagents based on fluorinated 
aromatic compounds (Bian N. et al.. Rapid Commun Mass Spectrom 11(16): 1781-1784, 
'^Detection via laser desorptidn and n:mss spectrometry of multiplex electrophore-labelled 
albumin." 1997) have been developed for electron capture detection, which is a highly 
sensitive idnisation and detection process that can be used with negative ion mass 
spectrometry (Abdel-Baky S. & Giese ILW., Anal Chem 63(24):2986-2989, '*Gas 
chromatography/electron csqpture negative-ion mass spectroncietry at the zeptomole level." 
1991). A fluorinated aromatic group could also be used as a sensitivity enhancing group. 
Aromatic sulphonic acids have also been used for inoproving sensitivity in negative ion 
mass spectrometry. 



Each type of Sensitivity Enhancing Group has diflFerent benefits, which depend on the 
method of ionisation used and on the methods of mass analysis used. The mechanism by 
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winch sensitivity is enhanced may also be different for each type of gro\q>. Some 
derivitisation mefhods increase basicity and thus promote protonation and charge 
localisation, while other mefhods increase surfece activity of the tagged pq)tides, which 
in5)roves sensitivity in surface desdiption techniques like Matrix Assisted Laser 
Desoiption lonisatibn (MALDI) and Fast Atom Bombardment (FAB). Negative ion mass 
spectrometry is often more sensitive because there is less background noise. Charge 
derivitisation can also change the fragmentation products of derivatised peptides, when 
collision induced dissociation is used. In particular sonie derivatisation techniques 
simplify fragmentation patterns, which is highly advantageous. The choice of Sensitivity 
Enhancing Group is deteimined by the mass spectrometric techniques that will be 
employed (for a review see Roth et al., Mass Spectrometry Reviews 17:255-274, "Charge 
derivatisation of peptides for analysis by mass spectrometry", 1998). For the purposes of 
this invention all of the known derivatisation techniques could be used with the peptide 
mass tags of this invention. The published protocols could be used without modification 
to derivitise the peptide mass tags of this invention after solid phase peptide synthesis or 
the protocols could be readily adapted for xise during solid phase synthesis if desired 

Analysis of peptides by mass spectrometry 

The essential features of a mass spectrometer are as follows: 

Met System Ion Source -> Mass Analyser -> Ion Detector -> Data Capture System 

There are preferred inlet systenos, ion sources and" mass analysers for the purposes of 
analysing peptides. 

Inlet Systems 

In the second aspect of this invention a chromatognqphic or electrophoretic separation is 
preferred to reduce the conplexity of the san5)le prior to analysis by mass spectrometry. 
A variety of mass spectrometry techniques are con^atible with separation technologies 
particularly capillary zone electrophoresis and High Performance Liquid Chromatography 
(HPLC). The choice of ionisation source is limited to some extent if a separation is 
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required as ionisation teicliniques such as MALDI and FAB (discussed below) which 
ablate material from a solid surface are less suited to chromatographic separations. For 
most purposes, it has been very costly to link a chromatogr2q)hic separation in-line with 
mass spectromeliic analysis by one of these techniques, DyMcnic FAB and ionisation 
techniques based on spraying such as electrospray, thennospray and APCI are all readily 
compatible with in-line chromatographic separations and equipment to perform such 
Uquid chromatography mass spectrometry analysis is commercially available, 

Ionisation techniques 

For many biological mass spectrometry applications so called 'soff ionisation techniques 
are used. Ihese allow large molecules such as proteins and nucleic acids to be ionised 
essentially intact The liquid phase techniques allow large biomolecules to enter the mass 
spectrometer in solutions with mild pH and at low concentrations. A nimiber of 
techniques are appropriate for use with this invention including but not limited to 
Electrospray Ionisation Mass Spectrometry OBSI-MS), Fast Atom Bombardment (FAB), 
Matrix Assisted Laser Desorption Ionisation Mass Spectrometry (MALDI MS) and 
Atmospheric Pressure Chemical Ionisation Mass Spectrometry (APCI-MS). 

Electrospray Ionisation 

Electrospray ionisation requires that the dilute solution of the analyte molecule is 
'atomised' into the spectrometer, i.e. rajected as a fine spray. The solution is, for example, 
sprayed from the tip of a charged needle in a stream of dry nitrogen and an electrostatic 
field The mechanism of ionisation is not fully understood but is thought to work broadly 
as follows. In a stream of nitrogen the solvent is evaporated With a small droplet, this 
results in concentration of the analyte molecule. Given that most bionsolecules have a net 
charge this increases the electrostatic repulsion of the dissolved molecule. As ev^oration. 
continues this repulsion ultunately becomes greater than the sur&ce tension of the droplet 
and the droplet disintegrates into smaller droplets. This process is sometimes referred to 
as a 'Coulombic explosion*. The electrostatic field helps to further overcome the sur&ce 
tension of the droplets and assists in tiie spraying process. The evaporation continues 
from the smallCT droplets which, in turn, e:Kplode iteratively until essmtially the 
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bidznolecules are in Hie v^our phase, as is all the solvent This technique is of particular 
in5)ortance in the use of mass labels in that ttie technique imparts a relatively small 
amount of energy to ions in the ionisation process and the energy distribution wiflun a 
population tends to fall in a narrower range when con^ared with other techniques. The 
ions are accelerated out of the ionisation chamber by the use of electric fields that are set 
iq) by appropriately positioned electrodes. The polarity of fbe fields may be altered to 
extract either negative or positive ions. The potential difference between these electrodes 
detennines whether positive or negative ions pass into the mass analyser and also the 
kinetic energy with which these ions enter the mass spectrometa:. This is of significance 
when considerijag fragnientation of ions in the nmss spectrometer. The more energy 
iii5)arted to a population of ions the more Kkely it is that fragmentation will occur through 
collision of analyte molecules wifli the bath gas present in the source. By adjusting the 
electric field used to accelerate ions from the ionisation chamber it is possible to control 
the fragmentation of idns. This is advantageous when fragmentation of ions is to be used 
as a means of removing tags from a labelled biomolecule. Electrospray ionisation is 
particularly advantageous as it can be used in-line with liquid chroinatogn5)hy, referred to 
as Liquid Chromatography Mass Spectrometry (LC-MS). 

Matrix Assisted Laser Desorption Ionisation (MALDl) 

MALDI requires that the biomolecule solution be embedded in a large molar excess of a 
photo-excitable 'matrix'. The application of laser light of the appropriate frequency 
results in the excitation of the matrix which in turn leads to rapid evaporation of the 
matrix along with its entrapped biomolecule. Proton transfer from the acidic matrix to the 
biomolecule gives rise to protonated forms of the biomolecule which can be detected by 
positive ion mass spectrometry, particularly by Time-Of-FUght (TOF) mass spectrometry. 
Negative ion mass spectrometry is also possible by MALDI TOF. This technique in5)arts 
a significant quantity of translational energy to ions, but tends not to induce excessive 
fragmentation despite this. Accelerating voltages can again be used to control 
fiiagmentation with this technique though. 
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Fast Atom Bombardment 

Fast Atom Bombardment (FAB) has come to describe a nurnber of techniques for 
vaporising and ionising relatively tnvolatile molecules. In these techniques a sanq)le is 
desorbed from a sur&ce by collision of the sample with a hi^ energy beam of xenon 
fltfwng or caesiimi ions. The san^le is coated onto a sur&ce with a sinQ)le matrix, 
typically a non volatile material, e.g. m-nitrobenzyl alcohol (NBA) or glycerol, FAB 
techniques are also con:5)atLhle with liquid phase inlet systems - the liquid eluting from a 
capillary electrophoresis inlet or a high pressure liquid chromatography system pass 
through a frit, essentially coating the surface of the frit with analyte solution which can be 
ionised from the frit surface by atom bombardment 

Mass Analysers 

Fragmentation of peptides by collision mduced dissociation is used in this invention to 
identify tags on proteins. Various mass analyser geometries may be used to fragcnent 
peptides aixd to. determine the mass of the fragments. 

MS/MS and MST analysis of peptides 

Tandem naass spectrometers allow ions with a pre-detennined mass-to-charge ratio to be 
selected and fragmented by collision induced dissodation (CID). The fragments can then 
be detected providing structural information about the selected ion. When peptides are 
analysed by QD in a tandem mass spectrometer, characteristic cleavage patterns aire 
observed, which allow the sequence of the peptide to be determined. Natural peptides 
typically fragment randomly at the amide bonds of the peptide backbone to give series of 
ions that are characteristic of the peptide. CID fragment series are denoted an, bn, Cn, etc. 
for cleavage at the n**^ peptide bond where the charge of the ion is retained on the N- 
terininal fragment of the ion. Similarly, fragment series are denoted x^, y^, Zq, etc. where 
the charge is retained on the C-tenninal fragment of tiie ion. 
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Trypsin and thronibin are fev oured cleavage agents for tandem mass spectrometry as ttey 
produce peptides with basic groups at both ends of the molecule, i;e. the alpha-amino 
group at the N-teraiinus and lysine or arginine side-chains at the C-temmms, This fevoinrs 
the formation of doubly charged ions, in which the charged centres are at opposite termini 
of the molecule. These doubly charged ions produce both C-temiinal and N-tenninal ion 
series ajfter CID. This assists in determiniTig the sequence of the peptide. Generally 
qjealdng only one or two of the possible ion series are observed in the CH) spectra of a 
given pieptide. In low-energy collisions typical of quadrupole based instruments the b- 
series of N-tenninal - fragments or the y-series of C-temainal fragments predominate. If 
doubly charged ions are analysed then both series are often detected. In general, the y- 
series ions predominate over the b-series. 

In general peptides fragment via a mechanism fliat involves protonation of the amide 
backbone follow by intramolecular nucleophilic attack leading to the formation of a 5- 
menibered oxazolone structure and cleavage of the amide linkage that was protonated 
(Schlosser A. and Lehmann W.D. J. Mass Spectrom. 35: 1382-1390, "Five-membered 
ring formation in unimolecular reactions of peptides: a key structural element controlling 
low-energy collision induced dissociation", 2000). Figure 16a shows one proposed 
mechanism by which this sort of fragmentation takes place. This mechanism requires a 
carbonyl group from an amide bond adjacent to a protonated amide on the N-terminal side 
of the protonated amide to carry out the nucleophilic attack. A charged oxazolonium ion 
gives rise to b-series ions, while proton transfer from the N-tenninal fragment to the C- 
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tenninal fragment gives rise to y-series ions as shown in figure 1 6a, This requirement for 
an appropriately located carbonyl group does not accoimt for cleavage at anride bonds 
adjacent to &e N-terminal amino acid, ^en the N-terminus is not protected and, in 
general, Vseries ions are not seen for the amide between the N-terminal and second 
aroino acid in a peptide. However, peptides with acetylated N-termini do meet the 
structural requirements of this mechanism and fragmentation can take place at the amide 
bond immediately after the first amino acid by this mechanism- Peptides with 
thioacetylated N-termini, as shown in figure 16c, will cleave particularly easily by the 
dxazolone noyBchanism as the sulphur atom is more nucleophilic than an oxygen atom in 
the same position. Fragmentation of the anride backbone of a peptide can also be 
modulated by methylation of the backbone. Methylation of an anride nitrogen in a 
peptide can promote fragmentation of the next anride bond C-terminal to the methylated 
anride and also favours the formation of b-ions. The enhanced fragmentation may be 
partly due to the electron donating effect of the methyl group increasing the 
nucleqphilicity of the carbonyl group of tiie methylated anride, while the enhanced 
formation of b-ions may be a result of the inability of the oxazolonium ion that fcwms to 
transfer protons to flie C-temrinal fragment as shown in figure 16b. In the context of this 
invention thioacetylation of the N-temrinus of a tag dipeptide can be used to enhance 
cleavage of the tag peptide at the next anride bond. Similarly, metixjdation of the lutrogen 
atom of an N-temainal acetyl or thioacetyl group will also enhance cleavage of the 
adjacent amide bond. Figures 17a and 17b illustrate pairs of tags that exploit these 
methods of enhancing cleavage at the marked amide linkage. 

The ease of fragmentation of the anride backbone of a polypeptide or peptide is also 
significantly modulated by the side chain fimctionalities of the peptide. Thus the 
sequence of a peptide determines where it will fragment most easily, hi general it is 
difficult to predict which anride bonds will fragment easily in a peptide sequence. This 
has iiqportant consequences for the design of the peptide mass tags of this inventioiL 
However, certain observations have been niade that allow peptide mass tags that fragment 
at the desired anride bond to be designed. Proline, for example, is known to promote 
firagmentation at its N-temodnal anride bond (Schwartz B.L., Bursey M.M., Biol. Mass 
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Spectrom. 21:92, 1997) as fiagmentation at the C-terminal anride gives rise to an 
energetically unfavourable strained bicycKc oxazolone structure. Aspartic add also 
promotes fragmentation at its N-terminal amide bond Asp-Pro linkages, however, are 
particularly labile in low energy CED analysis (Wysocld V.H. et aL, J Mass Spectronx 
35(12): 1399-1406, **Mobile and localized protons: a framework for unders t an d ing 
peptide dissociation." 2000) and in this situation aspartic acid seems to promote the 
cleavage of flie amide bond on its C-tenninal side. Thus prolme, and asp-pro linkages can 
also be used in the tag peptides of this invention to promote fragmentation at specified 
locations within a peptide. Figures 17c and 17d illustrate pairs of tags that exploit these 
methods of enhaindng cleavage at the marked amide linkage. Figure 17c illustrates a pair 
of tripeptide tags with the sequence alanine-proline-alanine. The proliue linkage 
promotes cleavage at its N-terminal amide. This is enhanced by the presence of a 
thioacetyl protecting group at the N-terarinus of the tripeptide and the cleavability is 
further enhtoced by methylation of the N-terminal nitrogen. The tags have the same mass 
but in the first tag tibiere is an alajoine residue with heavy isotopes in the third position of 
the tripeptide while in the second tag there is an alanine residue wilh heavy isotopes in the 
fibrst position of the tripeptide. Figure 17d illustrates a pair of tripeptide tags with the 
sequence aspartic acid-proline-alanine. The proline linkage promotes cleavage at its N- 
terminal amide. This is enhanced by the presence of the aspartic acid residue. The N- 
tenninus of the tripeptide is methylated to promote localised protonation here. The tags 
have the same mass but in the first tag there is an alanine residiie with heavy isotopes in 
tiie third position of the tripeptide while in the second tag there is an aspartic acid residue 
with heavy isotopes in the forst position of the tripeptide. 

A typical tandem mass spectrometer geometry is a triple quadnq)ole which conqnises two 
quadrupole mass analysers separated by a collision chaniber, also a quadrupole. This 
coDision quadrupole acts as an ion guide between the two mass analyser quadrupoles. A 
gas can be introduced into the collisian quadrupole to allow collision wilh the ion stream 
fixm the first mass analyser. The first mass analyser selects ions on the basis of their 
mass/charge ration which pass through the collision cell where they fragment The 
fragment ions are separated and detected in the third quadrupole. Induced cleavage can be 
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perfonned in geometries other than tandem analysers. Ion trap mass spectrometers can 
promote fragmentation throu^ introduction of a gas into the tr^ itself with which 
trapped ions will collide. Ion traps generally contain a bath gas, such as helium but 
addition of neon for sample, promotes fragmentation. Similarly photon induced 
fragmentation could be applied to trapped ions. Another fatvourable geometry is a 
Quadrupole/Orthogonal Time of Flight tandem instrument where the high scanning rate 
of a quadnq)ole is coupled to Ihe greater sensitivity of a reflectron TOF mass analyser to 
identify the products of fragmentation. 

Conventional 'sector' iiistruments are another common geometry used in tandem mass 
spectrometry. A sector mass analyser comprises two separate 'sectors*, an electric sector 
which focuses an ion beam leaviag a source into a stream of ions with the same kinetic 
energy using electric fields. The naagnetic sector separates the ions dn the basis of their 
mass to generate a spectrum at a detector. For tandem mass spectrometry a two sector 
mass analyser of this kind can be used where &e electric sector provide the first mass 
analyser stage, the magnetic sector provides the second mass analyser, with a collision 
cell placed between the two sectors. Two con^lete sector mass analysers separated by a 
collision cell can also be used for analysis of mass tagged peptides. 

Ion Traps 

Ion Trap mass analysers are related to the quadnipole mass analysers. The ion trap 
generally has a 3 electrode construction - a cylindrical electrode with 'cap' electrodes at 
each end forming a cavity. A sinusoidal radio frequency potential is applied to the 
cylindrical electrode while the cap electrodes are biased with DC or AC potentials. Ions 
injected into the cavity are constrained to a stable circular trajectory by the oscillating 
electric field of the cylindrical electrode. However, for a given anoplitude of the 
oscillating potential, certain ions will have an unstable trajectory and will be ejected from 
the trap. A san5)le of ions injected into the trap can be sequentially ejected from the trsq? 
according to Iheir noass/charge ratio by altering the osdlladng radio frequency potential. 
The gected ions can then be detected allowing a mass spectrum to be produced. 
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Ion tr^s are generally operated with a small quantity of a "bBXk gas', such as helium, 
present in the ion trap cavity. This increases bolh the resolution and the sensitivity of the 
device as the ions entering tiie trap are essentially cooled to the ambient ten5)erature of 
the bath gas through collision with the bath gas. Collisions both increase ionisation when 
a sairple is introduced into the trap and dsnspm the ainplitude and velocity of ion 
trajectories keeping fhem nearer the centre of the trap. This means that when the 
oscillating potential is changed, ions whose trajectories become unstable gain energy 
more rapidly, relative to the damped circulating ions and exit the trap in a tighter bunch 
giving a narrower larger peaks.. 

Ion traps can nmnic tandem mass spectrometer geometries, in feet they can mimic 
multiple mass spectrometer geometries allowing con5)lex analyses of trapped ions. A 
single mass sfpedes from a saniple can be retained in a trap, i.e. aU other species can be 
ejected and then the retained species can be carefully excited by super-in5)osing a second 
oscillating frequency on the first The excited ions will then collide with the bath gas and 
will fragment if sufficiently excited. The fragments can then be analysed ftrrther. It is 
possible to retain a fragment ion for further analysis by ejecting other ions and then 
exciting the fragment ion to fragment This process can be repeated for as long ais 
sufficimt san5)le exists to permit further analysis. It should be noted that these 
instruments generally retain a hig^i proportion of fragment ions aft^ induced 
fragmentation. These instruments and FTICR mass spectrometers (discussed below) 
represent a form of temporally resolved tandem mass spectrometry rather than q>atially 
resolved tandem mass spectrometry which is found in linear mass spectrometers. 

Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FTFCR MSy 
FnCR mass spectrometry has similar features to ion trq)s in that a sample of ions is 
retained within a cavity but m FTICR MS the ions are trapped in a high vacuum chamber 
by crossed electric and magnetic fields. The electric field is generated by a pair of plate 
electrodes that form two sides of a box. The box is contained in the field of a 
superconducting noagnet which in conjunction with the two plates, the trapping plates, 
constrain injected ions to a circular trajectory between flie trying plates, perpendicular 
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to the ^lied magaetic field. The ions are excited to larger orbits by applying a radio- 
frequency pulse to two "transmitter plates' which foim two further opposing sides of the 
box. The cycloidal motion of the ions generate corresponding electric fields in the 
remaining two opposmg sides of the box which conq>rise the "receiver plates'. The 
excitation pulses excite ions to larger orbits which decay as the coherent motions of the 
ions is lost through collisions. The corresponding signals detected by the receiver plates 
are converted to a mass spectrum by Fourier Transform (FT) analysis. 

For induced fragmentation experiments these instruments can perform iu a similar manner 
to an ion trap - all ions except a single species of iaterest can be ejected from the trap. A 
collision gas can be introduced into the trap and fragmentation can be induced The 
fragment ions can be subsequently analysed. Generally fragmentation products and bath 
gas combine to give poor resolution if analysed by FT analysis of signals detected by liie 
"receiver plates*, however the fragment ions can be ejected from the cavity and analysed in 
a tandem configuration with a quadrupole^ for example. 

Separation of labelled peptides by chromatography or electrophoresis 
In the optional second step of the second aspect of this invention, labelled biomolecules 
are subjected to a chromatographic separation prior to analysis by mass spectrometry. 
This is preferably High Performance Liquid Chromatography (HPLQ which can be 
coupled directly to a mass spectrometer for in-line analysis of the peptides as they elute 
from the chromatographic column. A variety of separation techniques may be performed 
by HPLC but reverse phase chromatography is a popular method for the separation of 
peptides prior to noass spectrometry. Capillary zone electrophoresis is another separation 
- method that may be coupled directly to a mass spectrometer for automatic analysis of 
eluting samples. These and other firactionation techniques may be applied to reduce the 
complexity of a mixture of biomolecules prior to analysis by mass spectrometry. 
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Applications of the invention 

Labelling peptides and polyp^tides and analysis byLC-A^S-MS 
In preferred embodimeats of the second aspect of this invention, the tags are used for the 
analysis of mixtures of peptides by liquid chromatography tandem mass spectrometry 
(LC-MS-MS). The use of tiie mass labels of this invention according to the second 
aspects will now be discussed in ^e context of the analysis of peptides. Peptide mass tags 
such as those in j&gures 1 and 2 may be used to label peptides. If the reactive functionalily 
on these con?)ounds is an N-hydroxysucdnimide ester then the tags will be reactive with 
free amino groups such as alpha-amino groups and epsilon amino groups in lysine. 

After attachment of the tags, the labelled peptides will have a mass that is shifted by the 
mass of the tag. The mass of the peptide may be sufficient to identify the source protein. 
In this case ojoly the tag needs to be detected which can be achieved by selected reaction 
monitcAing with, a triple quadrupole, discussed in noore detail below. Briefly, the first 
quadrupole of the triple quadrupole is set to let through ions whose naass-to-charge ratio 
corresponds to that of the peptide of interest, adjusted for the mass of the marker. The 
selected ions are then subjected to collision induced dissociation (CBD) in the second 
quadrupole. Under the sort of conditions used in the analysis of peptides the ions will 
fragment mostly at the amide bonds in the molecule. The markers in figures 1 and 2 have 
an amide bond, which releases the N-terminal portion of the tag on cleavage. Although 
the tags all have the same mass, the terminal portion is different because of differences in 
the substituents on either side of the amide bond. Thus the markers can be distinguished 
from each other. The presence of flie marker fragment associated with an ion of a specific 
mass should confirm that the ion was a peptide and the relative peak heights of flie tags 
from diflferent san^^les will give information about the relative quantities of the peptides 
in their sairples. If the mass is not sufficient to identify a peptide, either because a nmnber 
of terminal peptides in the san:5)le have the same terminal mass or because the peptide is 
not known, then sequence infijnnation may be determined by analysis of the con^lete 
CID spectrum. The pieptide fragmentation peaks can be used to identify the peptides 
while the mass tag peaks give information about the relative quantities of the peptides. 
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The analysis of proteiiis by tandem mass spectrometry, particularly mixtures of peptides, 
is coix?)licated by the 'noisiness* of the spectra obtained Peptides isolated from 
biological sanqjles are often contaminated with buflFering reagents, denaturants and 
detergents, all of which introduce peaks into the mass spectrum As a result, there are 
often more contamination peaks in the spectrum than peptide peaks and identifying peaks 
that coirespond to peptides is major problem, especially with small san5)les of proteins 
that are difficult to isolate. As a result various methods are used to determine which 
peaks correspond to peptides before detailed CID analysis is performed. Triple 
quadrupole based instruments permit *precursotr ion scanning' ( see Wilm M et al.. Anal 
Chem 68(3):527-33, *Tarent ion scans of unseparated peptide mixtures.'* (1996)). The 
triple quadrupole is operated in 'single reaction monitoring* mode, in which the first 
quadriq)ole scans over the full mass range and each gated ion is subjected to CCD in the 
second quadrupole. The third quadrupole is set to detect only one specific fragment ion, 
which is usually a characteristic fragment ion from a peptide such as immonium ions. 
The presence of phosphate groups can also be detected using this sort of technique. An 
alternative method used with quadrupole/time-of-flight mass spectrometers scans for 
doubly charged ions by identifying ions which when subjected to CID produce daughter 
ions with higher mass-to-charge ratios than the parent ion. A further method of 
identifying doubly charged ions is to look for sets of peaks in the spectrum which are only 
0.5 daltons apart with appropriate intrasity ratios which would indicate that the ions are 
the same differing only by the proportion of ^^C present in the n[iolecule. 

By labelling peptides with the mass labels of this invention, a novel form of precursor ion 
scanning may be. envisaged in which peptide peaks are identified by the presence of 
fragments corresponding to the niass labels of this invention after subjecting the labelled 
peptides to CID. In particular, the peptides isolated from each sainple by the methods of 
this invention may be labelled with more than one tag. An equimolar mixture of a 
'precursor ion scanning' tag \^ch is used in all saaq)les and a sample specific tag may be 
used to label the peptides in each sanq>le. In this way changes in the level of peptides in 
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different saBoples will not have an adverse effect on the identification of peptide peaks in 
a precursor ion scaii. 

Having identified and selected a peptide ion, it is subjected to CID. The CID spectra are 
often quite complex and determining which peaks in the CID spectrum correspond to 
meaningful peptide Augment series is a further problem in determining the sequence of a 
peptide by mass spectrometry. Shevchenko et al., Rapid Commun- Mass Spec. 11 : 1015- 
1024 (1997) describe a further method, which involves treating proteins for analysis with 
trypsin in 1:1 ^^O/^O water. The hydrolysis reaction results in two populations of 
peptides, the first whose tenninal carboxyl contains ^^0 and the second whose terminal 
carboxyl contains ^^O. Thus for each peptide in the sample there should be a double peak 
of equal intensity for each peptide where the double peak is 2 Daltons apart This is 
cdn5)licated slightly by intrinsic peptide isotope peaks but allows for automated scanning 
of flie CHD spectrum for doublets. The differences in mass between doublets can be 
determined to idmtify the amino add by the two fi-agments differ. This method may be 
applicable with the methods of this invention if N-termioal peptides are isolated. 

Protein Expression Profiling 

To understand the changes in a cancerous tissue, for exaiEple, requires an understanding 
of all of the molecular changes in that tissue, ideally relating these changes to normal 
tissue. To detranine all of the molecular changes requires the ability to measure changes 
in gene ejqnression, protein ejqpression and ultimately metabolite changes. It is possible to 
corDpare ttie ejqaression, between different tissue samples, of large numbers of genes 
simultaneously at the level of messenger RNA (mRNA) using microarray technology (see 
for example Iyer V.R. et al., Science 283(5398): 83-87, 'The transcriptional program in 
the response of human fibroblasts to serum." 1999), however mRNA levels do not 
correlate direcfly to the levels of protein in a tissue. To determine a protein expression 
profile for a tissue, 2-dimensional gel electrophoresis is widely used Unfortunately, this 
technique is extremely labcdous and it is difficult to coxnpBxe two or more sanoples 
simultaneously on a 2-D gel due to the difficulty of achieving reproducibility. As 
discuss^ above peptides may be analysed effectively using the methods of this invention. 
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The tags of this inveatioii allow the same pq)tide from different saiqples to be identified 
using LG-MS-MS. In addition, the relative quantities of the same peptide in different 
san5)les may be determined The ability to rapidly and sensitively determine the identity 
and relative quantities of peptides in a number of samples allows for e^qnression profiling. 
Therefore it is an object of this invention to provide in?)roved methods for comparative 
analysis of conqjlex protein saii5)les based on the selective isolation and labellmg of 
peptides. Two published ^roaches for the global analysis of protein expression are 
discussed and various methods for the analysis of particular protein states, such as 
phosphorylation and carbohydrate modification are also described below. 

Terminal peptide isolation for global protein expression profiling 
Isolation of N- or C-terininal peptides has been described as a metihiod to determine a 
global expression profile of a protein sample. Isolation of teraainal peptides ensures that at 
least one fflid only one peptide per protein is isolated thus ensuring that the complexity of 
the saii5)le that is analysed does not have more components than the original saniple. 
Reducing large polypeptides to shorter peptides makes the sample more amenable to 
analysis by mass spectrometry. Methods for isolating peptides from the temMni of 
polypeptides are discussed in PCT/GB98/00201, PCT/GB99/03258. 

Isolation of peptides containing cysteine 

As discussed earlier, Gygi et al. (Nature Biotechnology 17: 994 - 999 1999) disclose the 
use of 'isotope encoded aflSnity tags' for the capture of peptides from proteins, to allow 
protein expression analysis. The auttiors report that a large proportion of proteins (>90%) 
in yeast have at least one cysteine residue (on average there are -5 cysteine residues per 
protein). Reduction of disulphide bonds in a protein san^le and capping of free thiols 
with iodoacetamidylbiotin results in the labelling of aU cysteine residues. The labelled 
proteins are then digested, with trypsin for exanaple, and the cysteine-labeUed peptides 
may be isolated using avidinated beads. These captured peptides can then be analysed by 
liquid chromatogre5)hy tandem mass spectromyetry (LC-MS/MS) to detemme an 
e^qjression profile for the protein sancple. Two protein saix5>les can be corqpared by 
labelling the cysteine residues with a different isotopically modified biotin tag. This 
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approach is slightly more redundant than an approach based on isolating terminal peptides 
as, on average, more than one peptide per protein is isolated so there are more peptide 
species in the sample than protein species. This increase in con?)lexi1y is made worse by 
the nature of the tags used by Gygi et al. • 

As discussed above the afSnity tags described by Gygi et al have some disadvantages. 
Labelling each sample with a different isotope variant of the afSnity tag results in an 
additional peak in the mass spectrum for each peptide in each saiDple. This means that if 
two saiqples are analysed together there will be twice as many peaks in the spectrum. 
Similarly, if three samples are analysed together, the spectrum will be three times more 
con5)lex than for one sample alone. A furlher limitation, which is reported by the authors 
of the above paper, is the mobility change caused by the tags. The authors report that 
peptides labelled with a deuterated biotin tag elute slighfly after the same peptide labelled 
with an undeuterated tag. This means that comparative analysis of multiple saii5>les will 
be very difficult using the methods of Gygi et al. because of the comiplexity of the mass 
^ectra and the complexity of the chromatographic steps if mrare than 2 samples were 
analysed. 

An improved method for analysing protein samples by labelling cysteine residues is 
envisaged using tags of the form shown in Figure 8. This Figure illustrates a pair of 
in?)roved affinity tags derived from methionine. Different isotopically substituted forms 
of methionine would be used to prepare the two different tags. The total mass of each of 
the two tags is the same but the N-terminal methionine in each tag differs from the other 
tag by three Daltons. The alpha amino group of the dipeptide tag has been guanidinated 
to differentiate the fragmentation product of this amino acid from the fragmentation 
product of the second methionine residue and the natural methionine residues in protein 
and to promote protonation at this position in the tag during ionisation in a xdbss 
spectrometer. In addition these tags coii^irise a tMol reactive nMdeiinide fiinctioi^ 

In an enibodiment of the second aspect of this invention, a protocol for the analysis of a 
protein sairQ)le containing polypqptides with cysteine residues comprises the steps of: 
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L Reducing and reacting all cysteine residues in at least one protein sample with a 
maleindde affinity ligand mass tag; 

2, Qeaving the polypeptides with a sequence specific endoprotease; 

3. ' Capturing tagged peptides onto an avidin derivitised solid siqpport; and 
4 Analysing the captured tagged peptides by LC-MS-MS. 

The protein sanaples may be digested with the sequence specific endoprotease before or 
after reaction of the sample with the affinity ligand mass tag. 

Isolation of carbohydrate modified proteins 

Carbohydrates are often present as a post-translational modification of proteins. Various 
affmif y chroD:iatography techniques for llie isolation of these sorts of proteins are known 
(For a review see Gerard C, Methods Enzymol. 182: 529-539, 'Turification of 
glycoproteins." 1990). A variety of natural protein receptors for carbohydrates are 
known. The members of this class of receptors, known as lectins, are highly selective for 
particular carbohydrate fimctionalities. Affinity columns derivatised with specific lectins 
can be used to isolate proteins with particular carbohydrate noiodifications, whilst affinity 
columns comprising a variety of different lectins could be used to isolate populations of 
proteins with a variety of different carbohydrate modifications. la one eihbodiment of the 
second aspect of this invention, a protocol for the analysis of a sanocple of proteins, which 
contains carbohydrate modified proteins, comprises the steps of: 

1. Treating the san5>le with a sequence specific cleavage reagent such as Trypsin or 
Lys-C; 

2. Passing &e protein sample tbrough afBnity columns contain lectins or boronic add 
derivatives to isolate only carbohydrate modified peptides; 

3. Labelling the c£qptured sugar modified peptides at the free alpha amino groiq) 
generated by the sequence specific cleavage, using the peptide mass tags of this 
invention; and 

4- Analysing the tagged peptides by LC-MS-MS. 
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Ad N-hydroxysuccinimide activated tag coiald be used to label the free alpba-amino 
groirps. If Lys-C is iised thea each carbohydrate modified peptide will have a free 
epsilaQ-amino group as well as a free alpha amino group, both of which can be tagged. 

Many carbohydrates have vicinal-diol groups present, i.e. hydroxyl groups preseut on 
adjacent carbon atoms. Diol containing carbohydrates that contain vicinal diols in a 1,2- 
cis-diol configuration will react with boronic acid derivatives to form cyclic esters. This 
reaction is favoured at basic pH but is easily reversed at acid pH. Resin immobilised 
derivatives of phenyl boronic add have been used as ligands for ajBBnity capture of 
proteins with ds-diol containing carbohydrates. In one embodiment of the fourth aspect 
of this invention a set of afEinity ligand peptide mass tags comprising biotin linked to a 
phenylboronic acid entity could be synthesised, as shown in Figure fib. These boronic 
acid tags could used to label two separate 8anq)les coirprising peptides or proteins wilh 
carbohydrate modifications that contain vicinal ds-diols. In anotixer embodiment of the 
second aspect of this invention, a protocol for the analysis of a protein sanople containing 
carbohydrate modified polypeptides comprises the steps of: 

1 . Reacting at least one protein saorple at basic pH with a boronic acid affinity ligand 
mass tag, 

2. Cleaving the polypeptides with a sequence specific endoprotease, 

3. Capturing tagged peptides onto an avidin derivitised solid support and 

4. Analysing the captured tagged peptides by LC-MS-MS. 

The saniple may be digested with the sequence specific endoprotease before or after 
reaction of the sanq)le witii the affinity ligand mass tag. 

Vicinal-diols, in sialic adds for example, can also be converted into carbonyl groups by 
oxidative cleavage with periodate. Enzymatic oxidation of sugars containing terttnnal 
galactose or galactosamine with galactose OTddase can also convert hydroxyl groups in 
these sugars to carbonyl groups. Conq)lex carbohydrates can also be treated with 
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carbahydralte cleavage enzymes, siich as neuramidasey which selectively remove specific 
313^ modifications leaving behind sugars, which can be oxidised. These carbonyl groiq>s 
can be tagged allowing proteins bearing such modifications to be detected or isolated. 
Hydrazide reagents, such as Biocytin hydrazide CPierce & Waiiiner Ltd, Oiester, UK) 
will react with carbonyl groups in carbonyl-contaxning carbohydrate species (E.A. Bayer 
et al. , Anal. Biochein. 170: 271 - 281, 'TBiocytin hydrazide - a selective label for sialic 
adds, galactose, and other sugars in glycoconjugates using avidin biotin technology", 
1988). Altematively a carbonyl group can be tagged with an amine modified biotin, such 
as Biocytin and EZ-Link™ PEO-Biotin (Pierce & Warriner Ltd, Chester, UK), using 
reductive alkylation (Means G.E., Methods Enzymol 47: 469-478, •'Reductive alkylation 
of amino groups." 1977; Rayment L, Methods Enzymol 276:' 171-179, *TReductive 
alkylation of lysine residues to alter crystallization properties of proteins." 1997). 
Proteins bearing vicinal-diol containing caibohydrate modifications in a complex mixture 
can thus be biotinylated. Biotinylated, hence carbohydrate modified, proteins may then 
be isolated using an avidinated solid support 

A set of peptide mass tags according to this invention can be synthesised for the analysis 
of carbohydrate modified peptides that have been oxidised with periodate, as shown in 
Figure 6a. Figure 6a shows a set of two tags derived fix>m meihionine. Different 
isotopically substituted forrbs of methionine would be used to prepare the two difiPerent 
tags. The total mass of each of the two tags is the same but the N-terminal methionine in 
each tag diEfers firom the other tag by three Daltons. The alpha amino groiq) of the 
dipeptide tag has been guanidinated to differentiate the fragmentation product of fliis 
amino add from Ihe fragmentation product of the second methionine residue and to 
promote protonation at this position in the tag during ionisation in a mass spectrometer. 
A fiirther embodiment of the second aspect of this invention coicpdses the steps of: 

1. Treating a sample of polypeptides with periodate, so that carbohydrates with 
vicinal cis-diols oil glycopeptides will gain a carbonyl functionality; 

2. Labelling this carbonyl functionality with a hydrazide activated peptide mass tag 
linked to biotin, as shown in Figure 6 a; 
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3. Digesting fhe protein san^>l6 with a sequence specific endoprotease; 

4. Capturing tagged peptides onto an avidin derivitised solid siq)por1; and 

5. Analysing the biotmylated peptides by LC-MS-MS. 

The protein san5)le may be digested wilii the sequence specific endoprotease before or 
after reaction of the sample with Ihe afBnity ligand mass tag. 

Isolation ofPhosphopeptides 

iPhosphorylation is a ubiquitous reversible post-tramlational modification that appears in 
the naajority of signalling pathways of almost all organisms as phosphorylation is widely 
used as a transient signal to mediate changes in the state of individual proteins. It is an 
important area of researcb and tools which allow the analysis of the dynamics of 
phosphorylation are essential to a fiiU understandmg of bow cells responds to stimuli, 
which includes the responses of cells to drugs. 

Techniques for the analysis of pho^hoserine and phosphothreonine containing peptides 
are well known. One class of such methods is based on a well known reaction for beta- 
elimination of phosphates. This reaction results in phospboserine and phospbotbreonine 
forming dehydroalanine and methyldehydroalanine, both of which are Michael acceptors 
and will react with thiols. This has been used to introduce hydrophobic groins for 
aflSnity chromatography (See for example Holmes C.F., FEES Lett 215(1): 21-24, "A 
new method for the selective isolation of phosphoserine-containing peptides." 1987). 
Dithiol linkers have also been used to introduce fluorescein and biotin into phosphoserine 
and phosphothreonine containing peptides (Fadden P> Haystead TA, Anal Biochem 
225(1): 81-8, ^^Quantitative and selective fluorophore labelling of phosphoserine on 
peptides and proteins: characterization at the attomole level by capillary electrpphoresis 
and laser-induced fluorescence." 1995; Yoshida 0. et al.. Nature Biotech 19: 379 - 382, 
'"Bnrichment analysis of phosphorylated proteins as a tool for probing the 
phosphoproteome", 2001). The method of Yoshida et al. for afSnity enrichment of 
proteins phosphorylated at serine and threonine could be inproved by using the 
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maleimde tag showii in figure 8 to allow the .comparison of multiple san5)les. This 
would be particularly useful for the analysis of the dynamics of phosphorylation cascades. 

A tag peptide of the form shown in figure 7 would allow direct labelling of beta- 
eliiiiinated phosphothreonine and phosphoserine residues without a dithiol linker. The tag 
tetrapeptide of figure 7 is derived from methionine. Different isotopically substituted 
forms of methionine woxild be used to prepare the two different tags. The total miass of 
each of the two tags is the same but the N-teraiinal methLonine in each tag differs from the 
other tag by three Daltons. The alpha amino groiq> of the dipeptide tag has been 
guanidinated to differentiate the fragmentation product of tins anaino acid from the 
fragmentation product of the second methionine residue and natural methionine residues 
in proteins and to promote protonation at this position in the tag during ionisation in a 
mass spectrometer. The tag peptide is guanidinated at the N-Terminus to provide 
enhanced sensitivity and to distinguish the N-teraiinal residue from the C-terminal 
residue. The cysteine residue provides a free thiol, which can nucleophilically attack 
dehydroalanine and methyldehydroalanine. An inq>roved protocol for the beta- 
elimination based labelling procedure is known. This inq)roved procedure involves 
barium catalysis. (Byford M.F., Biochem J. 280: 261^261, **R^id and selective 
modification of phosphoserine residues catalysed by Ba2+ ions for their detection during 
peptide microsequenciag." 1991) This catalysis makes the reaction 20-fold fester reducing 
side-reactions to undetectable levels. The tag peptide shown in figure 7 could be easily 
coupled to dehydroalanine or methyldehydroalanine generated from beta-elimination of 
phosphates using barium catalysis. Thus in a fiirfher embodiment of the second aspect of 
this invention, peptides phosphorylated at serine and threonine may be analysed in a 
ixiethod conqxrising the steps of: . 

1. Treating a sanople of polypeptides witih barium hydroxide to beta-eliminate 
phosphate groups from phosphoserine and phosphothreonine; 

2. Labelling the resultant dehydroalanine or methyldehydroalanine functionalities 
with the thiol activated peptide mass tag linked to biotin, as shown in Figure 7; 

3. Digesting the protein sanople with a sequence specific endqprotease. 
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4. Cq)turing tagged peptides cmto an avidin derivitised solid support; and 

5. Analysing Hie biotinylated peptides by LC-MS-MS. 

The protein sanq)le noay be digested with the sequence specific endoprotease before or 
. after reaction of the san;)le with the afGnity ligand mass tag. 

A number of research groups have reported on the production of antibodies, which bind to 
phosphotyrosine residues in a wide variety of proteins, (see for examjple A.R. Frackelton 
et al., Methods Enzymol 201: 79-92, "Generation of monoclonal antibodies against 
phosphotyrosiiie and their use for affinity purification of phosphotyrosine-contaioing 
proteins.", 1991 and other articles in this issue of Melhods Enzymol). This means that a 
significant proportian of proteins that have been post-translationally modified by tyrosine 
phosphorylation may be isolated by afGnity chromatography using these antibodies as the 
afGnity column Hgand. 

These phosphotyrosine binding antibodies can be used in the context of this invention to 
isolate peptides firom proteins contaming phosphotyrosine residues. The tyrosine- 
phosphorylated protdns in a coii5)lex mixture may be isolated using anti-phosphotyrosine 
antibody afGnity columns. In a finther embodiment of the second aspect of this inveaition,- 
a protocol for the analysis of a sairple of proteins, which contains proteins 
phosphorylated at tyrosine, con5)rises the steps of: 

1. Treating the sample with a sequence specific cleavage reagent such as Trypsin or 
Lys-C; 

2. Passing the protein sample through afSnity colximns contain anti-phosphotyrosine 
antibodies to isolate only phosphotyrosine modified peptides; 

3. Labelling the captured phosphopeptides at the free alpha amino group generated 
by the sequence specific cleavage, using the peptide mass tags of this mvention; and 

4. Analysing file tagged peptides by LC-MS-MS. 
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An N-hydroxysuccmimide activated tag could be used to label the ftee alpha-amind 
groups. 

Immobilised Metal AfBnity diromatogrgphy QMAC) represeats a further technique for 
the isolation of phosphoproteins and phosphopeptides. Phosphates adhwe to xesins 
coiEprising trivalent metal ions particularly to Gal]ium(III) ions (Posewitch, MC. and 
Tempst, P., Anal, Chem., 71: 2883-2892, "Immobilized GalHum (HI) AJOanity 
Chromatography of Phosphopeptides", 1999). This technique is advantageous as it can 
isolate both serme/threonine phosphorylated and tyrosine phosphorylated peptides and 
proteins simultaneously 

IMAC can therefore also be used in the context of this invention for the analysis of 
samples of phosphorylated proteins. In a further embodiment of the second aspect of this 
invention, a protocol for the analysis of a sanq)le of proteins, which contains 
phosphorylated proteins, conoprises the steps of: 

1. Treating the sample with a sequence specific cleavage reagent such as Trypsin or 
Lys-C; 

2. Passing the protein sanq)le throu^ an affinity column conpdsing immobilised 
metal ions to isolate only phosphorylated peptides; 

3. Labelling the captured phosphopeptides at the free alpha amino group generated 
by the sequence specific cleavage, using the peptide mass tags of this invention; 
and 

4. Analysing the tagged peptides by LC-MS-MS. 

An N-hydroxysuccinimide activated tag could be used to label the firee alpha^amino 
groins. 

In an alternative embddintient of the second aspect of this invention, a san^le of 
phosphorylated proteins may be analysed by isolating phosphorylated proteins followed 
by analysis of tixe N or C terminal peptides of ih& phosphoproteins. Techniques for the 



wo 03/025576 



55 



PCT/GB02/04240 



isolation of termmal peptides are disclosed in a ninnbelr of patent applications, e.g. 
W098/32876, WO 00/20870 and BP 01304975.4. A protocol for the analysis of a sample 
of proteins, which contains phosphoryiated proteins, would comprise the steps of: 

1. Passing the protein sample through an afBnity column con^prising immobilised 
metal ions to isolate only phosphoiylated proteins; 

2. Isolating C and/or N tOTninal peptides froicn tiie captured phosphoryiated protems; 

3. Labelling the captured terminal peptides, using the peptide mass tags of this 
invention; and 

4. Analysing the tagged peptides by LC-MS-MS. 
Examples 

Example 1 - Syntheses ofX-Met^-Met-Gly-OHCA) and ofX-MeUMetd^-Gly-OH (B) 

A pair of peptides were synthesised using conventional automated synthesis tectmiques to 
illustrate the features of this invention (both starting from commercially available Fmoc- 
Gly-Trt-PS resin from Rapp Polymere, Geimany). The two peptides A and B are shown 
in Figure 10 and will be referred to as the two Met-Met-Gly 033) peptides. 

Deuterated methionine (Metd^) is available from ISOTEC Inc, Miamisburg, Ohio, USA. 
The Fmoc reagent for use in a peptide synthesiser must, however, be synthesised 
manually from the xmprotected deuterated methionine. 

Synthesis ofN-(9'Miiorenylmethoxycarbonyl)'I^methionine-n^^ (Fmoc-Met(f) 

The synthesis of Fmoc-Metd^ (shown in Figure 9a) was carried out in two steps. 

i. Synthesis of P-Flnorenylmethyl-pente^uorphenyl carbonate 

8.4mL (60mmol) triethylamine were added at O^C to a mixture of llg (60mmol) 

Pentafluorophenol and 15,5g (6Qirmiol) chloroformic add (9-fluorenylmethyl) ester in 
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lOOmL dry ether. After 2 hoiJis reaction, 20 mL cx)ld water was poured to the solution. 
The organic layer was washed twice with water, dried After ev^oration of the solvent, 
the obtained product was crystallized from heptane. Yield: 16,4g (67%) 

2. Synthesis of N'(9'Fliiormylmeihoxycarbonyl)'L'met^^ 

2,2g (14,5iinnol) I^nGbetMonine-meth)i-d3 (Metd^) was suspended in SOioL acetone. 2.Sg 
(29nmol) sodium hydrogencarbonate and 60 mL water and then 5,7g (14mmol) 9- 
Fluorenylmefhyl-pentafluoiphenyl carbonate were added to the stirred suspension. After 
48 hotirs reaction, the pH of the clear solution was altered to pH3 and the organic layer 
was extracted by ethylacetate. After drying the extracted organic layer, the ethylacetate 
was evaporated and the product was precipitated by addition of heptane. That procedure 
(dilution with ethylacetate and precipitation by hexane) was repeated twice before 
obtaining the pure product, Fmoc-Metd^ (Yield: 5,0g (92%); Fp: 126-128^0; Wd - 
30°, c=l, DMF) 

The reaction sequences of the peptide synthesiser fOT the preparation of the two peptides 
shown m Figure 1 0 are listed below. 

Peptide sequence (A) 

• Swelling of 50mg of Fmoc-Gly-Trt-PS resin for 5min in 2nil of 
dimethylformamide (DMF); 

• Removal of the Fmoc group with piperidine in DMF following standard protocols; 

• Dissolving of 49mg (0.32nimol) of l-hydroxybenzotriazole (HOBt) in 800fil 
DMF; 

• Addition of 120mg (0.32minol) Fmoc-Met to the HOBt solution; this solution was 
added to the resin and incubated for 3xmn; 

• 50^il (0.32inmol) of diisopropjdcarbodiimide (DIG) was then added; coupling time 
(0.4M activated amino acid) 

• Removal of the Fmoc group with piperidine in DMF following standard protocols. 

• Dissolving of 49mg (0.32mmol) HOBt in SOO^il DMF; 
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• Adding to 12Qmg (0.32iiimol) Fmoo-Metd^; this solution was added to the resin 
and incubated for Smin; 

• SO|xl (0.32inmol) DIG was ttien added; coupling time (0.4M activated andno acid) 

• Removal of the Fmoc group with piperidine in DMF following standard protocols; 

• 1 50 mg (0.32imnol) ,3oc2X-0Su" were dissolved in 800^1 DMF and this solution 
was added to the resin; 

• 53p.l of Diisopropylethylamine (DffBA) were then added to the iresin, and the 
coupling was left to proceed for 3 hours (0.4M activated species); 

• After wsLshing the resin the desired substance was cleaved from the resin with linl 
TFA containing 2,5% H2O, EtaSiH and thioanisole each within Ih; 

• Adding of 3Qml water to TFA solution after jSltration, removing of all solvent by 
lyophilisation. 

A white powder of peptide sequence (A) resulted 

Peptide sequence (S) 

• Swelling of 50mg resin 5min in 2ml DMF; 

• Remioval of the Fmoc groiq) with piperidine in DMF following standard protocols; 

• Dissolving of 49mg (032mmDl) HOBt in SOOjil DMF; 

• Adding to 12Qmg (0.32nQn3ol) Fmoc-Metd^; this solution was added to the resin 
and incubated for 3min; 

• SO^il (0.32mmol) DIG was added; coupling time (0.4M activated amino acid) 

• Removal of the Fmoc group with piperidine in DMF following standard protocols; 

• Dissolving of 49mg (0.32mmol) HOBt in SOOjil DMF; 

• Adding to 12()mg (0.32namol) Fmoc-Met; this solution was added to the resin and 
incubated for Smin; 

• 50^1 (0.32mmol) DIG was added; coiipling time (0.4M activated amino add) 
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• Removal of the Fmoc group with piperidine in DMF following standard protocols; 

• 150 mg (0.32nmiol) ,3oc2X^OSu" were dissolved in 800^1 DMF and ttds solution 
was added to theresin^ 

• 53^1 DffEA were added, 3h coupling time (0.4M activated species) 

• After washing liie resin the desired substance was cleaved from the resin with Iml 
TFA containing 2.5% H2O, EtsSiH and thioanisole each within Ih; 

• Adding of SOnil water to TFA solution after filtration, and removal of all solvent 
by lyophilisation. 

A ligjit yellow powder of peptide sequence (B) resulted. 

HPLC 

After cleavagie, ca. 80% pure product was obtained for each peptide. The products were 
then puiified by HPLC. . . 

MS 

The identity of the peptides A and B was confirmed by mass spectrometry. A mass-to- 
charge ratio of 496 was observed as the main peak in both MALDI and ESI mass spectra 
for both products, which fits the calculated mass of both peptides. A mass spectrum from 
the analysis of a mixture of peptides A and B by ESI mass spectrometry is shown in 
Figure 11. It can be sem that the two peptides have mass spectra that overlap almost 
exactiy, which is as expected. 

MS/MS 

Figure 13 shows the proposed firagmentation reaction mechanism for the products of 
collision induced dissociation of the n:iodel peptides A and B shown in figure 10. Figure 
12 shows a pair of ESI MS/MS spectra generated by an LCQ ion trap mass spectrometer 
firon Finnigan MAT. The ESI MS/MS spectra show the fragmentation products of 
peptides A and B. The desfred b2-fi:agment ion (see Figure 10) has a high intensity for 
both substances (273 after loss of ammonia for A and 270 after loss of anmionia for B). 
Figure 14 shows and ESI-MS/MS spectrum of fihte firagmentation products from the 
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analysisof aiidxtureofpeptLdes Aand A and Bw€to present mlheidxtureia a ratio 
of 70:30 respectively. This ratio can be seen in liie intensities of fte b2-fragment ion 
peaks at no/z 273 and 270 for peptides A and B respectively. This spectrum shows that 
the tags can reveal the ratio of their associated peptides when pairs of samples are 
conopared. Figure 15 shows a Unear regression curve for a series of ESI-MS/NdS 
experiments with peptides A and B. The graph shows a plot of the ratio of A to B in the 
mixture against the ohsaved intensities of the b2-jQ:agment ions from BSI-MS/MS 
analysis of the mixtures. The graph shows that there is a good correspondence between 
the expected and observed ratios. 

Example 2 - d-lBisftert-btdtyl-oxycarboTiyl) guanidinoj-hexanoic acid-N- 
kydroxysuccinimidester 

The synthesis of the guamdino-active ester linker shown in Figure 9b was carried out in 3 
stages shown below. 

L Synthesis of cmtinchiminomethane sidphonic acid 

50mL acetic anhydride and 2 drops of cone, sulphuric add were added to 45g (397n]mol) 
30% aqueous hydrogen peroxide under ice cooling. After 30 minutes, lOOmL 
(1157mmol) acetic anhydride was added to tiie solution at 10-12^C once again The 
reaction mixture was stirred ovemight and reached the room temperature in that time. 
After adding 150 mL methanol, the solution made from lOg (131mmol) thiourea in 
500mL methanol was dropped slowly into the reaction at 15-20°C. The reaction was 
stirred at RT for 48 hours. After filtration, the solution was condensed to 60 mL. The 
obtained product was filtered and washed with ethanol and purified by crystallisation 
from acetic acid (ca. IL). Yield: 6,0g (37%). 

2. Synthesis of 6-Gtumidinohexanoic acid 

6.5g (50mmol) 6-aminohexahoic acid and 6.9g (50mmol) sodium carbonate were 
dissolved in 50mL water. 6.2g (5Qmmol) amino-iminomethane sulphonic add was added 
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tmcto stttring to the solutidn. After 20 hours, the product was filtered and washed with 
acetic acid, methanol and then ether. Yield: 6.6g (76%). 

5.. Synthesis of6'[Bis (tert-bidtyl-oxycarbonyl) guanidinoj-hexanoic add-N-hydroxy 
succinimide ester 

9.5g (SSmmol) 6-Guanidinohexanoic add and 55g (270nmol) N,0- 
Bistrimethylsilylacetamide were stirred in lOOmL dichloromefhane and heated under 
refluxing until a clear solution was obtained (the reaction was left for approximately 10 
hours). 46g (210mmol) Di-tert-hxityl pyrocarbonate was added to the solution at RT and 
the reaction mixftire was heated under refluxing for 3 hours after having been stirred at 
RT for 18 hours (ovemi^t). The solution was then cooled to RT and washed with a 10% 
citric acid solution and a sodium chloride solution. After evaporation of the solvent, the 
pyrocarbonate was distilled at 80-90°C under vacuum. The viscous liquid obtained (30g) 
was dissolved in 100ml dichloromethane with 8,6g (75mmol) N-Hydroxysuccininride. 
15,Sg (75mmol) dicyclohexylcarbodiimide (DCC) was added in portions to the reaction 
mixture with stirring at RT. After 17 hours, the urea was removed by filtration. The 
solution was washed with a 10% citric add solution and after removing the solvent, the 
product was purified by chromatography (silica gel, solvent: 
dichloromethane/ethylacetate). The product was then crystallized firom diisopropylether. 
Yield- 6,0g (19%). Rf: 0,77 (dichloromefhane/ethylacetate : 3/1). Fp: 108-109°C. 

Example 3 

Experimental protocols 

Two pairs of TMT reagents are shown in Figures 1 8a and 18b. The reagents are peptide 
tags according to this invention comprising one 'tag* amino acid linked to a sensitisation 
group ([1] , [2], [3]), which is a guanidino-functionality, one *mass normalisation' amino 
acid and in the second pair of tags, a cleavage enhancement group, which is proline in this 
case ([4]). These tags are designed so that on analysis by collision-induced dissociation 
(QD), the tag fragment is released to give rise to an ion with a specific mass-to-charge 
ratio. The currant accepted model of peptide scission during CID requires protonation of 
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the peptide backbone followed by nucleophilic attack of the carbonyl moiety of the 
protonated annde by the next N-tetminal carbonyl residue in the peptide chain to form a 
relatively stable oxazolone leading to scission of the amide bond ([5]). The sensitisation 
enhance is linked to the N-temrboal methionine residue by an ainide bond but cleavage 
does not take place at this amide as there is no amide correctly positioned to allow 
cyclisation and cleavage at this position so cleavage can only take place between the two 
methionine residues. This means that the N-terminal methionine is distinguished j&om the 
second methionine by the mass of the guanidino sensitisation groiq). Thus each pair of 
tags allows a pair of peptides to be distinguished by MS/MS analysis. Each tag can also 
bear a reactive functionality. In the figure, the reactive functionality, jR, is not specified 
but could be an N-hydroxysuccioimide ester, which allows for the specific labelling of 
ainino-groups. Qearly this reactive functionality can be easily varied to allow different 
biological nucleophiles to be labelled In addition, the tag design can be readily modified 
to accommodate an afBnity ligand such as biotin. Furthermore, it should be clear that 
more than two tags can be generated allowing for conq>arison of additional san9>les or for 
the introduction of labelled standards. 

In the following examples, peptides, listed m Table 7, have been synthesised as if they 
have been completely labelled on the alpha amino groiip with the above tags, i.e. the tag 
was *p:e-incorporated' during the synthesis to test the performance of the tags 
ittdependentiy of the labelling reactions, so that in flie following examples the *R' group 
shown in Figure 18a and 18b is the peptide sequence to which the tag is attached. The 
tagged peptides were analysed by ESI-MS/MS and LC-ESI-MS/MS. 

Figures 18a and 1 8b show the structures of two versions of the TMT markers. The tags 
are modular comprising different functional con^onents that correspond to individual 
synthetic compon^ts hi the automated synthesis of these reagents. Each tag con^prises a 
s^itisation grottp and a mass differentiated group that together comprise the 'tag 
fragment' that is actually detected The tag fiagment is linked to a mass normalisation 
group that ensures that each tag in a pair of tags share the same overall mass and atomic 
composition. The first and second generation tags are distinguished by the presence of an 
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additional fiagmeatation enhancmg gcoup, proline, in the second generation tag. The tags 
will additionalty conoprise a reactive functionality (R) to enable the tag to be coupled to 
any peptide but in the present experiments, R is one of a nuniber of peptide sequences. 
The proposed tag firagmmt that results from the markers is sliowh in Figure 18c based on 
current theories on backbone protonation dependent mechanisms of fragmentation ([S]). 

Syntheses of TMT labelled peptides 

The peptides shown in Table 7 were synthesised using conventional automated Fmoc 
synthesis techniques (both starting from commercially available Fmoc-Gly-Trt-PS resin 
from Rapp Polymere, Geiinany). Deuterated methionine (Metd^) is available from 
ISOTBC lhc,-Miamisburg, Ohio, USA. An Fmoc-Metd^ reagent fox use in a peptide 
synthesiser was synthesised manually from the unjprotected deuterated methionine as 
described above. The guanidino 'sensitisaition' enhancement group was synthesized as an 
N-hydroxysuccinimide ester (NHS-ester) as described above and added to deprotected 
alpha-amino groups of synthetic peptides by conventional methods during automated 
peptide synthesis. 
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Table? 



Peptide sequences 


Generation 1 


Generation 2 




Mi 


Ion at m/z 
(z) 




Ion at m/z 


lA 


TMT-GVATVSLPR 


1319.7 


660.9 (2+) 


1415.7 


708.9 (2+) 


IB 


TMT-GVATVSLPR 


1319.7 


660.9 (2+) 


1415.7 


708.9 (2+) 


2A 


TMT- 

GLGBHNIDVLEGNEOnNAAK 


2688.31 


897.1 (3+) 


2784.3 


928.8 (3+) 


2B 


TMT- 

GLGEHNEDVLEGNEQFINAAK 


2688.31 


897.1 (3+) 


2784.3 


928.8 (3+) 


3A 


TMT-GNKPGVYTK 


1383.7 


462.2(3+) 


1479.7 


494.3 (3+) 


3B 


TMT-GNKPGVYTK 


1383.7 


462.2 (3+) 


1479.7 


494.3 (3+) 


4A 


TMT- 

GDPAALJORAKNTEAARRSRAR 
KLQRlklKQGGC 


3874.6 


969.7 (4+) 


3970.6 


993.7 (4+) 


4B 


TMT- 

GDPAALKRAKNTEAARRSRAR 
KLQKMKQGGC 


3874.6 


969.7 (4+) 


3970.6 


993.7 (4+) 



Table 7: Abundance, ratio experiiniBnts were performed with the peptides listed above. 
HPLC experiments were performed with the first three peptide sequences listed above. 
Pairs of synthetic peptides were prepared with either the first or second TMT pre- 
incoiporated into the peptide sequence at the N-tenrdnus. Sequences, mono-isotopic 
molecular inasls and mass-to-charge ratios of predominant ion species are listed for eadi 
tag. 

MS/MS analysis ofTMT4abelled peptides 

Aoalyses were performed by liquid chromatography mass spectrometry using either a 
Finnigan LCQ Deca wiHi a Finnigan Surveyor HPLC System (Column: 50 x 2.1mm, 
5|im HyPURITY™ Elite C18) or a QTOF 2 firom Micromass Ltd, Manchester, UK with a 
Cap-LC HPLC system firom USAP Technologies (Column: PepMap C18 HPLC colmm 
firom Dionex with a 7S^m iimer diameter was used; the resin had a 3|xm particle size, 
lOOApore size). 
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Ion abundance ratios were determined by sunnnation and averaging of a nmnber of 
spectra of an elnting peptide pair followed by determination of the ratios of flie peak 
intensities for the tag firagments. 

Example 3a - Comparison of and generation TMT tags 

To demonstrate the advantages of a tag desigaed with a fragmmtation enhancing group 
two different TMT designs were explored The tags differ by the inclusion of proline in 
the generation tags (Figure 18a and 18b). Proline is known to rahance cleavage of the 
amide bond on its N-^terminal side ([4]), 

Initial experiments on the firagmentation of the 1^^ generation of TMT in a Micrbmass 
QTOF 2 instrunaent showed that the intensity of the TMT fragments was very dependant 
cm the amino acid sequence of the peptide and at low collision energies the tag fragments 
did not accurately reflect the abundances of the tagged peptides. As shown in figure Ic 
the expected tag fragments have an m/z of 287 or 290 but, in the first generation tags, a 
second pair of ions with mass-to-chaxge ratios of 270 or 273 is observed. These 
fragments, are thought to result from the loss of ammonia from the expected tag 
fragmaits. An example of a typical C3D spectrum for a peptide labelled with the first 
generation tags is shown in Figure 19. At lower collision energies the intensities of these 
two fragment classes varied with the sequence of the attached peptide but at higjier CID 
energies the 270/273 fragments are observed almost exclusively. At these higher collision 
energies, the 270/273 tag fragments did accurately reflect the abundances of the peptide 
pairs. Additional e^^^eriments using a Finnigan LCQ ion trap mass spectrometer have 
shown the same fragmentation pattern as the QTOF for the first generation TMT units. 
The observed ammonia loss occurs ia both LCQ and QTOF experiments. These 
instruments differ in the manner in which CID is carried out (selective activation and 
fragmentation of only tiie parent ion in the LCQ versus serial fragmentation of all ions in 
QTOF). Since the loss of NH3 takes place in both instruments, this suggests that the loss 
of NH3 may take place directly from the parent peptide ion, rather than as a result of 
subsequent collisions of the expected fi:agment ion and is an intrinsic feature of this tag 
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Structure. la both instnnnents the ^pearance of the 270/273 ftagment is fevoured by 
higher collision eoaergies. This meant that to get consistent behaviour from this tag 
analysis had to take place at high collision energies. 

Although CID is more selective in the LCQ, it is unfortunately limited in its use with 
TMTs as it is not possible to detect small CID fragmentation products of larger precursors 
with ttiis type of instrummt In the QTOF instrument, however, at the higher energies of 
collision, corisecutive fraigmentations were problematic, hi the Q-TOF, the series of b- or 
y-ion fragments that provide sequence information are further fragmented to give smaller 
species so that no sequence information could be obtained from the peptide. As a result 
of the need for high energy CTD to guarantee tiie release of the tag fragments and to 
obtain accuriate quantification, the first generation TMT units can only be reliably used for 
the purposes of quantification without peptide identification in the QTOF. This will also 
be true of other serial MS/MS instruments. 

Figure 19a and 19b show typical CID spectra for a peptide labelled with the first 
generation TMT at collision energies of 40V (Figure 19a) and 70V (Figure 19b). In 19a 
weak peaks in both of the 270/273 and 287/290 regions can be seen at 40V, but they do 
not accurately represent the abundances of the tagged peptides. Some sequence specific 
y-series ions can be observed though at tiiis accelerating potential. In 19b the peaks 
corresponding to the tag fragment can be seen clearly at miz 270 and 273 for the fu^ 
generation TMT at a collision energy of 70V. At this collision energy the intensities of 
these peaks accurately represent the relative abundances of each peptide (see inset for 
zoom of the tag region in Figure 19b) but no sequence data can be determined. 

These results lead to the development of a generation TMT, which has a proline 
residue in the TMT unit to enhance the fragmentation. To quantify the effect of the 
proline in the second generation tags a 50:50 mixture of a peptide labelled with the first 
and second generation tags respectively was analysed by MS/MS. The two resultant 
peptides, with the sequences Guanidinocaproyl-Met(D3)-Met- 
GLGBHNIDVLEGNEQFINAAK and Guamdinocaproyl-Met(D3)-Pro-Met- 
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GLGEHlOTVI^GajffiQHNAAK, had ions correspondiiig to the IM+3H]^ spedes at 
mass-to-charge ratios of qyproxiiDately 897 and 929 for the first and second generation 
tags respectively. To get the same collision conditions for both precursors, the peptides 
were first mixed and then analysed in a QTOF instrument with the quadrupole set to 
alternately select ions with m/z around 897 or 929. Each selected ion was subjected to 
CID at increasing collision energies. 

At collision energies of 20V or less no fragmentation at all was observed for either type of 
TMT. At a collision energy of 30V-35V it is possible to see the e^qpected TMT fragment 
ions at m/z of 290 in the CID spectrum for the peptide with the second gmeration tag but 
no firagment ions m/z of 273 can be seen in the spectrum for the peptide with the first 
generation tag at the same energy, see Figure 20, alftiough a weak firagment at mix of 290 
can be seen. The tag fragment for the peptide containing the first generation TMT is not 
observed until a collision energy of 70V is used (data not shown). Smaller peptides 
labelled with the first generation IMT gave rise to the tag firagment at lower energies but 
high collision energies were required to release the tag firagment firom larger peptides. 
The size dependence of the peptide on the energy needed to release the tag fragment was 
much smaller for the second generation TMT. Coniparison of the CID spectra fix)m 
peptides labelled with TMTs containing proline wiih peptides labelled with TMTs 
without prolihe shows clearly that the introduction of the proline amino acid as a 
fragmentation enhancer leads to fragmentation in favour of the expected TMT tag 
firagment without resorting to very hi^ collision energies. At these lower energies the 
abundance ratios also of the TMT firagment ions, from the proline containing TMTs, 
accurately reflect the ratios of the concentrations of fhe tagged peptides. In addition, the 
identification of the peptide via its b and y series can also be performed at these lower 
collision energies. 

Figure 20a 20b and 20c show MS and MS/MS spectra for triply charged ions of the 
peptide 2 (see Table 7) labelled with the first and second generation TMTs. The peptides 
were analysed in a QTOF II instrument. Figure 20a shows the MS-mode TOF spectrum 
of the peptide mixture. For CID analysis the first quadrupole was set to alternately select 



wo 03/025576 



PCT/GB02/04240 



67 

ions with miz around 897 or 929. The C3D spectrum at 35V for Guanidmocaproyl- 
Met(D3>Met.GLGEHNroVOGl^QFINAAK is shown in Figure 20b and tie CED 
spectrum at 35V of Guanidinoc£5)royl-Met(D3)-Pro-Met- 

GLGEHNIDVLEGNEQFINAAK is shown in Figure 20c. The presence of the expected 
tag fragment at m/z of 273 is not detected for the first generation TMT in Figure 20b but 
the expected fragment at 290 is clearly observed at 35V for the second generation TMT in 
Figure 20c. 

The improved behaviour of the second generation TMT can be seen in Figure 21 which 
shows a typical CID spectrum of a peptide labelled with these tags. The tag firagments 
revealing the abundance ratios are easily seen at the expected m/z values of 287 and 290. 
In addition it is possible to see both b-series and y-series ions allowing the sequence of 
the peptide to be determined. CID was performed at a relatively low collision energy of 
40V. The peaks at m/z 287 and 290 for the second generation TMT at 40V represent the 
relative abundances of each peptide (see inset with zoom of the relevant region of the 
mass spectrum). 

Mgure 22 clearly shows that the charge state of the TMT tagged peptide does not affect 
the appearance of the tag fragments in the CID spectra of the labelled peptides. In this 
example a peptide labelled with a first generation TMT is shown but the same result is 
found for the second generation tags. This is advantageoiis as it means that scanning of 
the spectrum can take place without conoplex adjustments of the scanning software to 
compensate for the charge state of each peptide. In other isotope tagging procedures, 
such as ICAT, the charge state alters the naass difference between each tagged ion pair, 
sucb that for doubly charged ions the mass difference is halved, for triply charged ions the 
mass difference is a third of that for the singly charged ions, etc. Software to scan for 
peptide pairs using conventional isotope labelling techniques, like ICAT, naust therefore 
compensate for these sorts of problems by allowing for the different possible mass 
differences or by ignoring certain classes of ion, which either incnreases the chance of 
erroneous identification of peptide pairs or misses out on potential ion pairs that could 
offer useful information. 
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In Figure 22, comparison of spectra for peptide 4 from Table 7 where CID has been 
performed on the [M:+4H]4+ (bottom spectrum) and IMH-5H]5+ (top spectrum) species. 
The pejptide above contains the first generation TMT. The 4+ ion has an m/z of 969.3 
while the 5+ ion has an m/z of 775.6. The tag firagment ion appears at the expected naass- 
to-charge ratio of 273 in both spectra indicating that only one charge localises to the tag 
fragment 

Figure 23 shows data for expected and observed ratios of peptides from ESI-MS/MS 
aiialyses of the 4 peptides listed in Table 7. Peptides with both first and second 
generation TMIs incorporated into them were analysed. Abundance ratios were 
determinedby analysing the peak maxima at the d3 (A) and do (B) ofthe tag fragment ion 
peaks afla: peak normalization at 290 and 287 for TMT2. Measurements were made in a 
QTOF instrument The table inset to Figure 23 shows q3cpected and observed ratios the 
b-ion fragments from the MS/MS analysis of eluting TMT labelled peptides. It can be 
seen that both generations of TMT provide accurate representation of abundance ratios of 
the peptides in the n[iixtures and that the tags show linear behaviom: over the entire range 
of peptide ratios tested. 

Example 3b - Demonstration of identical chromatographic behaviour of TMT tags in 
LC-MS 

A noixture of four pairs of synthetic peptides were synthesised with the second generation 
TMT units pre-incoiporated at the N-terininus of each peptide. The peptide pairs were all 
analysed together. Each peptide pair was prepared at a different ratio. The sequences, 
theoretical mono-isotopic masses, the doubly charged ion masses are shown in Table 7. 
The peptides were loaded onto a C-18 reverse phase HPLC column and separated. The 
purpose of this experiment was to demonstrate the exact co-elution of con:eq)Qnding pairs 
of peptides with different TMT tags without any other con5)lications. The ratios of the 
peptide pairs were expected and found to be consistent over the entire elution time for 
each peptide pair and so a further object of this experiment was to show that 
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quantification of tiie peptide pairs could be perfonned with sinsultaneous sequence 
detemdnatiQn and that it would be possible to scan for other peptides without waitiag for 
the complete elution of the peptide. Complete ehition of peptide pairs is necessary for 
accurate quantification using the ICAT strategy and other peptide analysis techniques 
using conventional isotope labelling. Tliis greatly restricts liie throughput of these 
approaches. 

Figure 24 shows the co-elution of each peptide pair, peptides A and B for each peptide 
frona Table 7, clearly seen in the CIS-reverse phase BDPLC traces. For each peptide the 
ion currents at m/z 287 and 290 are recorded corresponding to the tag firagments fi*om 
each of the TMTs. The bottom trace for each peptide is the total ion current The elution 
profiles of 3 peptides monitored at each of the mass-to-charge ratios of the hz ions from 
the tag fragments are shown. It can be clearly seen that the peptide pairs elide as a single 
fraction. In MS/MS mode, monitoring of the tag fragment ions produces virtually 
identical results in each case. For each peptide pair the observed ratios matched the 
expected ratios to a reasonable degree. 

Since the tagged peptides exacfly co-elute, the ratios of the peptide pairs are conserved 
throughout the elution profile, which means that it is not necessary to integrate the total 
ion current for the eluting ions to determine the relative abundance of each peptide pair. 

Example 3 c - Analysis of the sensitivity and robustness of the IMF technology 

To provide an efifective in5)rovement over conventional isotope labelling, the TMT 
technology must be at least as sensitive as other isotope labelling methods and must have 
a broadly similar dynamic range. la addition, the properties of flie tags must be consistent 
over the whole eTcpected dynamic range of the samples to be analysed. Finally, the ability 
of these tags to overcome noise in the mass spectrometer needed to be demonstrated. To 
test Ihe dyoamic range of the system and to show that tiie properties of the TMT tags are 
consistent over the entire dynamic range, the conservation of peptide ratios was examined 
at a range of different concentrations of one of the tagged synthetic peptides (peptide 3A 
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and 3B). As can be seen from Figure 25, a serial dilutian of peptides 3 A and 3B, mixed in 
a ratio of 40:60, from 100 pmoles to 100 finoles, fbe ratios were reliably conserved with a 
deviation within 5% in most cases, from the expected ratio. These and other results (not 
sho^) indicate that Hie tag p^tides do not reduce tiie intrinsic sensitivity with which a 
peptide is detected in the MS/MS mode, i.e. the analysis of IMT labelled peptides by CID 
has essentially the same sensitivity as the MS/MS of untagged peptides. The intrinsic 
sensitivity seems to be instrument specific based on con5)aiisQiis between the LCQ and 
QTOF in the analysis of small peptides (the tag fragments from large peptides labelled 
with TMTs cannot be detected on the LCQ because of the intrinsic limitations on CID 
with this type of instrument). The sensitivity with which it is posisible to determine the 
sequence of tagged peptides does not seem to be have been significantiy changed in any 
of the peptides tested so far. Meaningful differences in the ratios of the pq)tides can be 
detected over the entire range of concentrations tested (Figure 25). 

Figures 26a 26b and 26c show the results of a spiking experiment in which peptides pairs 
3A and 3B (500 finol in total, in a ratio of 40:60 respectively) bearing a second generation 
TMT was mixed with a tiyptic digest of Bovine Serum Albumin (2 pmol). Figure 26a 
shows the base peak chromatbgram from analysis in the MS-mode. During the run, the 
first five most intensive ions analysed in MS mode were automatically fi:^gmented in the 
MS/MS mode at 30V. The TMT peptides pairs were investigated and located on the base 
peak chromatogram- The ratio of the TMT2 firagments was then calculated from the 
MS/MS spectrum for the mass [M+3H] (a zoom of the tag firagments is shown in figure 
9b and the whole spectrum shown in figure 9c) by conrparing the intensity of the dO and 
d3 TMT fragment mass-to-charge ratios (287 and 290). 

In a further experiment, the ability to detect labelled peptides in a background of 
contaminating peptides was examined. The peptides pairs 3A and 3B bearing a second 
generation TMT was mixed with a 20-fold excess of a tryptic digest of Bovine Serum 
Albumin. The peptide mixture was then analysed m an LC-QTOF instrument The five 
most intense ions from each etution scan were subjected to CID to identify the peptides. 
The expected peptides were detected and the region of spectrum corresponding to the tag 
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jGcagments was analysed to detenxiine the abundance ratio of the detected peptides. 
Analysis by dD (collision energy of 30V), provides the spectrum shown in Figure 26c. 
The ratio of the peptides 3 A and 3B was found to be 39,3% to 60.7% respectively, by 
coirparison of the peak intensities at the fragment ion mass-to-charge ratios of 290 (d3 
TMT unit) and 287 (dO TMT unit). The expected ratio was 40% 3 A to 60% 3B, thus the 
peptide ratio was detected with a 1.7% error. The quality of the MS/MS spectrum 
obtained (Figure 26b and 26c) at the low collision energy used, allows a clear 
identification of the peptide sequence by database searching. This experiment clearly 
shows that a conc5)lex mixture of tryptic peptides does not hindCT the analysis of peptide 
pqirs labelled with the generation TMT tags and the TMTs can help to overcome noise 
in the sacq)^. In addition th^e do not seem to be any suppression problenis - ratios of 
peptides present in. low concentrations can still be determined in the preseiace of other 
peptides that are in high concentrations. 
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CLAIMS: 

1. A set of two or more mass labels, eacli label in tlie set coiipising a mass marker 
moiety attached via a cleavable linker having at least one amide bond to a mass 
normalisation moiety, wherein the aggregate mass of each label in the set may be the 
same or different and the naass of the mass mark^ moiety of each label in the set may be 
the same or different, and wherein in any group of labels within the set having a mass 
marker moiety of a common mass each label has an aggregate mass different from all 
other labels in that group, and wherein in any group of labels within the set having a 
common aggregate mass each label has a mass marker moiety having a mass different 
from that of all other mass marker moieties in that group, such that all of the mass labels 
in the set are distinguishable from each other by mass spectrometry, and wherein the n:iass 
marker moiety conopises an amino add and the mass normalisation moiety conpises an 
ansinoacid 

2. A set of mass labels according to claim 1, in which each label in the set conqxrises 
a mass marker moiety having a common naass and each label in the set has a unique 
aggregate mass. 

3. A set of mass labels according to any preceding claim, in which each label in the 
set comprises a mass marker moiety having a unique naass and each label in the set has a 
common aggregate mass. 

4. A set of mass labels according to claim 3, in which each mass n^rker moiety in 
the set has a common basic stmcture, and each mass normalisation moiety in the set has a 
common basic structure that may be the same or different from the common basic 
structure of the mass marker moieties, and wherein each mass label in the set conqpzises 
one or more mass adjuster moieties, the mass adjuster moieties being attached to or 
situated within the basic structure of the mass marker moiety and/or the basic structure of 
the mass normalisation moiety, such that every mass marker moiety in the set conopises a 
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different miniber of mass adjuster moieties and every mass label in the set has the same 
nmnber of mass adjuster moieties. 

5. A set of mass labels according to claim 4, each mass label in the set having the 
following structure: 

M(A)y.L-X(A)2 

wherein M is a mass normalisation moiety comprising an aiiiino acid, X is a inass marker 
moiety con:q>risiiig an anoino acid, A is a mass adjuster moiety, L is a cleavable linker 
comprising the amide bond, y and z are integers of 0 or greatar, and y+z is an integer of 1 
or greater. 

6. A set of mass labels according to claim 4 or claim S, wherein the mass adjuster 
moiety is selected from: 

(a) an isotopic substituent situated within the basic structure of the mass 
marker moiety and/or within the basic structure of the noiass normalisation moiety, and 

(b) substituent atoms or groins attached to the basic structure of Hie mass 
marker moiety and/or attached to the basic structure of the mass normalisation moiety, 

7. A set of mass labels according to claim 6, wherein the mass adjuster moiety is 

selected from a halogen atom substituent, a methyl group substituent, and or ^^C 
isotopic substituents. 

8. A set of mass labels according to claim 7, wherein the mass adjuster moiefy is a 
fluorine atom substituent. 

9. A set of mass labels according to any preceding claim, wherein the cleavable 
linker attaching the n:iass marker moiety to the mass nonnalisation moiety is a linker 
cleavable by collision induced dissociation. 
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10. A set of mass labels accordiiig to any preceding claim, wherein the cleavable 
Iroker con^)ris6s proline and/or aspartic add. 

11. A set of noass labels according to any preceding claim^ wherein the nGiass marker 
moiety and/or the mass normalisation moiety con^xrises a firagmentation resistant group. 

12. A set of mass labels according to any preceding claim, wherein the mass riaarker 
moiety coicprises a sensitivity enhancing group, such as a pre-ionised group. 

13. A set of mass labels according to any preceding claim, wherein tbie naass marker 
moiety or the mass normalisation moiety con^pises a reactive functionality. 

14. A set of mass labels according to any preceding claim, wherein each n:iass label in 
the set conqirises an afBnity capture ligand, such as biotin. 

15. A set of two or more analytes, each analyte in the set being different and being 
attached to a unique mass label or a xmigue conibiQation of mass labels, from a set of mass 
labels as defmed in any of claims 1-14. 

16. A set of analytes according to claim 15, wherein one or more analytes in the set is 
a standard analyte having a known mass, or known chromatographic properties. 

17. A set of two or more probes, each probe in the set being diflEerent and being 
attached to a unique mass label or a unique combination of mass labels, from a set of miass 
labels as defined in any of claims 1-14. 

18. A set of analytes or probes according to any of claims 15-17, wherein each analyte 
or probe is attached to a uxiique combination of mass labels, each combination being 
disltinguished by the presence and absence of each mass label in the set of noass labels 
and/or the quantity of each miass label attached to the probe. 
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19. A set of analytes or probes accordjBg to any of claixns 15-18, herein each analyte 
or probe coxiqnises a biomolecule. 

20. A set of analytes or probes aocx>rding to claim 19, wherein the biomolecule is 
selected from a DNA, an KNA^ an oligonucleotide, a nucleic add base, a protein and/or 
an amino acid. 

21. A method of analysis, which method cocoprises detecting an analyte by identifying 
by inass spectrometry a mass label or a combination of mass labels relatable to the 
analyte, wherein the mass label is a mass label from a set of mass labels as deiSned in any 
of claims 1-14. 

22. A method according to claim 21, wherein the mass labels en^loyed are labels 
conq)rising an afiBnity capture ligand, and labelled analytes are separated from unlabelled 
analytes by capturing the afGnity capture ligand with a counter ligand. 

23. A method according to claim 21 or claim 22, ia which t^o or more analytes are 
detected by simult&eously identifying their mass labels or cotnbinations of mass labels 
by mass spectrometry. 

24. A method according to any of claims 21-23, wherein each analyte is identified by a 
unique combination of mass labels from a set or array of mass labels, each combiaatioli 
beiag distinguished by the presence and absence of each mass label in tiiie set or array 
and/or the quantity of each mass label. 

25. - A method according to any of claims 21-24 for identifying two or more analytes, 
wherein the analytes are separated according to their mass, prior to detectmg their mass 
labels by mass spectrometry. 
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26. A mefhod according to claim 25, herein the analytes to be identified are nsixed 
witii one or more standard analytes, having known mass or known properties in the 
separation method used, to facilitate the characterisation of the analytes. 

27. A method according to claim 26, herein the standard analytes are as defined in 
claim 16. 

28. A method according to any of claims 25-27, wherein separation is carried out by a 
chromatographic or electrophoretic method. 

29. A method according to any of claims 21-28, wherein the mass spectrometer 
en^loyed to detect Ihe mass label comprises one or more mass analysers, which mass 
analyseirs are capable of allowing ions of a particular mass, or range of masses, to pass 
through for detection and/or are capable of causing ions to dissociate. 

30. A method according to claim 29, wherein ions of a particular mass or range of 
masses specific to one or more known noass labels are selected using the n:iass analyser, 
the selected ions are dissociated, and the dissociation products are detected to identify ion 
patterns indicative of the selected mass labels. 

31. A method according to claim 29 or claim 30, wherein the mass spectrometer 
comprises three quadrupole mass analysers. 

32. A method according to claim 31, wherein a first mass analyser is used to select 
ions of a particular mass or mass range, a second n:ia5S analyser is used to dissociate the 
selected ions, and a third mass analyser is used to detect resulting ions. 

33. A method according to any of claims 21-32, which mefhod comprises: 

(a) contacting one or more analytes with a set of probes, wherein the probes 
are as defined in any of claims 17-20, 

(b) identifying an anal3rte, by detecting a probe relatable to that analyte. 
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34. A method according to claim 33, wherein the tnass label is cleaved from the probe 
prior to detecting the tnass label by mass spectromelxy. 

35. A method according to claim 33 or claim 34, which method conpises contacting 
one or more nucleic acids with a set of hybridisation probes. 

36. Use of a mass label from a set of labels as defined in any of claims 1-14, in a 
method of analysis. 

37. Use according to claim 36 in a method of 2-dim^ional electrophoretic analysis. 

38. Use according to claim 36 in a method of 2-dimetisional mass spectrometric 
analysis. 

39. Use according to any of claims 36-38 in a method of sequencing one or more 
nucleic adds. 

40. Use according to any of claims 36-3 8 in a method of gene expression profiling. 

41 . Use accordiag to any of claims 36-3 8 in a method of protein expression profiling. 

42. Use according to any of claims 36-38 in a method of nucleic acid sorting. 
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Figure 4 
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Figure 6 
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Figure 10 
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Figure 13 
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Fig. 19b 
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MS-mode analysis of the TMT1 and TMT2 peptide mixture: 
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CID MS/MS-inode: 
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Fig. 21 
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